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SEQ 
ID 
NO: 


Predicced 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
(AsAlanine, C=Cysteine, D-Aspartic Acid, 
Glutamic Acid, F-Phenyl alanine, G=01ycine, 
H«HiQtidine, I^Xsolcucine, K^Lysine, 
L=Leucine, M=rMethionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, v=Valine, 
I7=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\opossible nucleotide insertion) 








ISDYFERRVBQPLyGLDGSAAKEATEEQSAl,?TLMSVMLAKPRL 

DTEQLAQRGAGLCPTFVSAQQNSPSSTGSGNTEHSCSSQKQISI 

QHRQT\QSDbTIEKISALENSKNSDLEKKEGRlDDLLRANCDLR 

RQI\DEQQKMI.EKYK\ERLNRCFDNEPRNFLIEKSKQEKMACRD 
KSMOr)PTil?Tif5WT7'l"P\/DUf2Bon'nr»T?rMi7rnr>^vn T^rwn- tv^aa.— 
Ai3F»wi-'Ku«-U'JXir 1 1 VKHLjAdFrfiQWTDGyAPQNLIKQQERINSQ 

REEIERQRKMLAKRKPPAMGQAPPATNEQKQRKSKTNGAENBTL 

TLAEYHEQEEIFKLRLGHLKKEEAEIQAELBRLERVRNLHIREb 

KRIHNEDNSQFKDHPTLNDRYLLLHLLGRGGFSEVYKAPDLTEQ 

RYVAVKIHQLNKNWRDEiCKENYHKHACREYRIHKEriDHPRIVKL 

u£>r ux V JjJK X CEXjNDLDFYIjKQHKLMSEKEARSIIMQ 

IVNALKYLNEIKPPIlHYDLKPGlSriIiLVNGTACGEIKlTDTOLS 

KIMDDDSYNSVDGMELTSQGAGTYWYLPPECFWGKEPPKISNK 

VDVWSVGVIFYQCLYGRKPFGHNQSQQDILQENTILKATEVQFP 

PKPWTPEAKAFIRRCLAYRKBDRIDVQQLACDPYLLPHIRKSV 


S996 


1612 


981 


DQOACULGLMLTXJEPGILEFDPSWIGSWTQRy'SWVSWRSRPGCE 
LFSIWFGSIVNEGYLNSASEGEEFCIYNRNPNACSYGVAVGVL 
AFLTCLLYLALDVYFPQISSVKDRKK\AVriSGHPWSGEPHPAA 
FffAFLWFTGDSCYL\AWQWQVSKPKDNPLNEGTDASPGRPSPFS 
FFS I FTWS LTAAXjAVRRFKDXjS FQE EYSTL FP \ ASAQP 


5997 
5998 


1612 


981 


dqqacllgi.mltlepgilepdpswigswtqr;swvswrsrpgce 
lfsiwfgsivnegylnsasegeepciynrnpnacsyovavgvl 
afltcllylaldvyppqissvkdrkk\avlsghpwsgbphpaa 
fwaflwftgdscvl\anqwqvskpkdnplnegtdaspgrpspfs 
ppsiftwsltaalavrrfkdlsfqeeystlfpXasaqp 


5999 


1612 


981 


L>aaACt,IjGLMLTLEFGII,EFDPSWlGSWTQR/sWVSWRSRPGCE " 
LFSI WFGSI VNEGYLNSASEGEEFCI YNRNPNACS YGVAVGVL 
AFLrCLLYXALDVYFPQISSVKDRKK\AVLSGHPWSGEPHPAA 
FWAFLWPTGDSCYL\ANQWQVSKPKDNPLb7BGrDASPGRPSPFS 
FFS I FTWSLTAALAVRRFKDLSPQEE YSTLPP \ ASAQP 


^000 


2 


1790 


RPPMEKARRGGDGVPRGPVLHIVWGFHHKKGCQVEFSYPPLIP 
GDGHDSHTLPEEWKYLPFIiALPDGAHNYQEDTVFFHLPPRNGNG 
ATVFGrSCYR\QIEAKALKVRQADITRETVQKSVCVLSKLPLYG 
LLQAKLQLITHAyFEEKDPSQISIIlKELYEH^4NSSLGGASLEGS 
QVYLGLSPRDLVLHFRHKGLILPKLILLEKKVLPYISPVNKLVG 
ALMTVLSLFPGMrEHGLSDCSQYRPRKSMSEDGGLQESNPCADD 
FVSASTADVSHTNI^TIRKVMAGNHGEDAAMKTEEPLFQVEDSS 
KGQEPNDTNQYLKPPSRPSPDSSESDWETLDPSVLSDPNLKERE 
QLGSDQTNLFPKDSVPSESIiPITVQPQANTGQWLIPGLISGLB 
EDQYGMPIAIFTKGYLCLPYMALQQHHIiLSDVTVRGFVAGATNI 
LFRQQKHLSDAIVEVBEALIQIEIDPELRiajJNPTTADLRFADYL 
VRHVTENRDDVFiiDGTGWEGGDEWIRAQPAVYIHALIAATLQLV 
LFRIVNVAKKIGNVMVTT\SRNWQTGK\AVGQSVGGAFS\SAK 
TA\KSSWLSTFTTSTSQSt.TEPPDEKP 


6001 


101 


1561 


TEPCRTAENCTATMSENNKNSLESSLRQLKCHFTWNLMEGEUSL 
wA^f V r 1 « a r,r yiN KJiif jtft i M ^-DTbLAl IjKHLKGQNEAALECLR 
KAEELIQQEHADQASIRSLVTWGKTYAWVYYHMGRLSDVQIYVDK 
VKHVCEKFSSPYRI3SPELDCEEGWTRLKCGGNQNERAKVCFEK 
ALEKKPKNPEFTSGLAIASYRLDNWPPSQKAIDPLRQAIRLNPD 
NQYLKVLLALKLHKMREEGEEBGEGEK\LVBEflI,BKAPG\VrDV 
LRSAA\KFYRGKDEPDKAIEI,LKKALEYIP\NNAyijHCQIGCCY 
RAKVFQVMNLRBNGMYGKRKLLELIGHAVAHLKKADEANDNLFR 
VCSlLAStaALADQYEDAEYYFQKEFSKELTPVAKQLLHLRYGN 
FQLYQMKCEDKAIHHFIEGVKINQKSREKEKMKDKLQKIAKMRL 

SKNGADSEAUiVLAFLQELNEKMQQADEDSERGIiESGSLIPSAS 
SWNGE 




176 


103 8 

! 
i 


AFAkSPSRGElRHTHIHTPRHTPRCTMAESHLQSSLITASQPPEI 
^LHFDADGSCSYLEGKELQNLIQELQQARKKAGLELS PEMKTFVD 
2YGQRDDGKIGIVELAHVLPTEBNFLLLPRCQQLKSCE\EFMKT 
^RKYDTDHSGFIETEELKNFLKDLIjEKANKTVDDTKLAEYTDLM 
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5EQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino aciQ segment containing signal peptTai" 

(A=AlaninB, C^Cysteine, D=Aspartic Acid, Je« 

Glutamic Acid. F=sPhenvlal a^-i -nei r> r»i^,«j«^ 
' * •^**ci*yjioiiatiiiie^ «=t»XyCj.ne « 

H=Histidine, I=Isoleucine, K-L/sine, 

Ii=Leucine, M^^Methionine, N^Asparagine, 

P-Proline, Q=Glutamine, R-Arginine, 

S=Serine, T=Threonine, Vs=Valine, 

W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 

Codon, /^possible nucleotide deletion, 

\=possible nucleotide in«5fir-hi rmi 


eoo2 






LiU.FDSEmiJGKI.ELTEMARLLPVQENFLbKFUUXKMCGKEFNKA 
PELYDQIXJWGYIDENEIJDALLKDLCEKNKQDIIJINNITTYKKWI 
MALSDGGKLyRTDLALILCAGDN 


6003 


977 


81 


LAPPGGGLHIPPRTPLSHSRPPPSHHAPHPSPLPLPPADLHPHS 
SMAQRSDLLELDCQLTRDRWWSHDENLCRQSGLNRDVGSLDP 
EDLPLYKEKIiEVYFSPGHFAHGSDRRMVRLEDLFQRFPRTPMSV 
EIKGKNEELIREQ/VLVRRYDRITEITIWASEKSSVMKKCKAANP 
EMPLSFTISRGFWVLLSYYLGLLPFIPIPEKFFFCFLPNIIKRT 
YFPFSCSCLNQLIAWSKWLIMRKSLXRHLESRGVQWFWCLNE 
ESDFEAAFSVGATGVITDYPTALRHYLDNHGPAARTS 


6004 


140 


4098 


GKIiRAFRGMKRLICKRICDYKSFDDEESVDGNRPSSAASAFKVP 

APKTSGNPAWSARKPGSAGGPKVGAGASKEGGAGAVDEDDFIKA 

KlWPSIQIYSSRELEETLNKIRErLSDDKHDWDQRAMAI.KKIR 

SLLVAGAAQYDCFFQHLRLLDGALKLSAKDLRSQWREACITVA 

KLSTVLGNKFDHGAEAIVPTLFNLVPNSAKVMATSGCAAIRPI I 

RHTHVPRLIPLITSNCTSKSVPVRRRSFEFLDLLLQEWQTHSLE 

RHAAVr.VETIKKGIHDAI>AEARVEARKTYMGLRNHFPGEAETLY 

KSLEPSYQKSLQTYLKSSGSVASLPQSDRSSSSSQESLNRPPSS 

KWSTAJTPSTVAGRVSAGSSKASSLPGSLQRSRSDIDVNAAAGAK 

AHHAAGQSVRSGRUSAGALNAGSYASLEDTSDKLCXSTASEDGRV 

RAKLSAPLAGMGNAKADSRGRSRTKMVSQSQPGSRSGSPGRVLT 

TTALSTVSSGVQRVLVNSASAQKRSKIPRSQGCSREASPSRLSV 

ARSSRIPRPSVSQGCSREASRESSRDTSPVRSPQPLASRHHSRS 

TGALYAPEVYGASGPGYGISQSSRLSSSVSAMRVIiNTGSDVEEA 

VADALLLGDIRTKKKPARRRYESYGMHSDDDANSDASSACSBRS 

YSSRKaSIPTyMRQT\EDV\AEVI,NRC^SSfWSBRKEGLLGLaN 

LLKNQRTLSRVELKRLCEIPTRMFADPHGKRVFSMFUETLVDFI 

QVHKDDLQDWLFVLLTQLLKiCMGADLLGSVQAICVQKALDVTRES 

FPNDLQFWILMRFTVDQTQTPSLKVKVAILKYI ETLAKQMDPGD 

FINSSETRUVVSRVITWTTEPKSSDVRKAAQSVLISLFEUWTPE 

PTMLIGALPKTPQDGATKLLHNHLRNTGNGTQSSMGSPLTRPTP 

RSPANWSSPLTSPTNTSQKTLSPSAFDYDTENMNSEDIYSSL^G 

VTEAIQNPSFRSQEDMKEPLKRDSKKDDGDSMCGGPG\MSDPRA 

GGDATDSSQTAL\DNKASLLHSMPTHSSPRSRDYNPYNYSDS1S 

PFNKSALKEAMFDDDADQFPDDLSLDHSDLVAELLKRr,SNHNER 

VEERKIALYELMKIiTQEESFSVWDEHFKTILtLLLETLGDKE:>T 

IRALALKWJREILRHQPARFKNYAELTWKTLEAHKDPHKEVVR 

SAEEAASV\LATSI\SPEQCIK7LCPIIQTADYPIMIAAIKMQT 

KVIERVSKETLNLLLPEIMPGLIQGYDWSESSVRKACVFCLVAV 

HAVIGDELKPHLSQLTGSKMKLLNLYIKRAQTGSGGADPTTDVS 




140 

- 


4098 

\ 

I 
C 
£ 
F 


UKIJiAFRGMRRi>iciCRICDYKSFDDBBSVDGNRPSSAASAFKVP~ 

APKTSGNPANSARKPGSAGGPKVGAGASKEGGAGAVDEDDPIKA 

FTDVPSIQIYSSRELEETLNKlREILSDDKHDWDQRANAIiKKIR 

SLLVAGAAQYDCFFQHLRLLDGALKLSAKDLRSQWREACITVA 

HLSTVLGNKPDHGAEAIVPTI^PNLVPNSAKVMATSaCAAIRFII 

RHTHVPRLlPLITSNCTSKSVPVRRRSFEPLDLIiLQEWQTHSLE 

RHAAVLVETrKKGIHDADAEARVEARKTYMGLRKHFPGEAETLY 

MSLBPSYQKSLQTYLKSSGSVASLPQSDRSS5SSQESLNRPFSS 

KWSTAWPSrVAGRVSAGSSKASSLPGSLQRSRSDIDVNAAAGAK 

aHHAAGQSVRSGRLGAGALKAGSYASLEDTSDKXDGTASEDGRV 

RAKLSAPLAGMGMAKADSRGRSRTKMVSQSQPGSRSGSPGRVLT 

rTALSTVSSGVQRVLVNSASAQKRSKIPRSQGCSREASPSRLSV 

^RSSRIPRPSVSQGCSREASRESSRDTSPVRSFQPLASRHHSRS 

PGALYAPEVYGASQPGYGISQSSRLSSSVSAMRVLNTGSDVEEA 

/ABALLLQDIRTKKKPARRRYESYGMHSDDDANSDASSACSERS 

fSSRNGSIPTYMRQT\EDV\AJSVXKRCASSIIWSEHKEGLLGLQN 

iLKWQRTLSRVBLKRLCBIFTRMFADPHGKRVFSMPLETLVDPI 

fVHKI>DLQDMl,PVIXTQI,LKKMGADLLGSVQAKVQKALDVrRES 

TNDI^FNILMRPTVDOTQTPSLKVKVAILKYIETLAKQMDPGD 

'INSSErRLAVSRVITWTTEPKSSDVRKAAQSVLISLFEUTTPE 
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SEQ 
ID 
NO; 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D^spartic Acid, E=. 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H=Histidine, I=IsoleucinQ, K^Lysine, 
L=Leucine, M=Msthionine, K=Asparagine, 
pBproline, Q=Glutaraine, R=Arginine, 
SaSerine, r=Threonine, V=»Valine, 
W=Tryptophan, Yj=Tyxosine, X^Unknown, *aStop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








a I WJjiXaALii' K.i U ULKjArKLLHNHJjRNTGNGTQSSMGS PLTRPTP 
RSPANWSS PLTSPTNTSQNTLS PSAPDYDTENMITSED I YSSLRG 
VTEAIQNFS FRSQEDMNEPLKRDSKKDDGDSMOGGPG \MSDPRA 
GGDATDSSQTAL\DNKASLLHSMPTHSSPRSRDYNPYNYSDSIS 
PFNKSALKEAMPDDDADQFPDDLSLDHSDLVAEliLKELSNHKER 
VEERKXALYELMKLTQEESFSVWDEHFKTILIjLLLETLGDKEPT 

iraialkvlreilrhqparfknyaeltvmktleahkdphkewr 
saeeaasv\i*atsi\spe:qcikvlcpiiqtadypinlaaikmqt 
kvi ervs ketlnlllpeimpgli qg ydnsessvrkacvpclvav 
havigdelkphlsqltgs kmkllmlyi kraqtgsggad pttdvs 

GQS 


S005 


133 


5955 


RSSGRRQEQLGQFPGRERKGMASGLGS PS PCSAGSEEEDMDALL 
NNSLP PPHPBNEEDPEBDLSETETPKIiKKKKKPKKPRDPKI PKS 
KRQKKERMLLCRQLGDSSGEGPEFVEEEEEVALRSDSEGSDYTP 
GKKKKKKLGPKKBKKSKSKRKEEEEEDDDDDDDSKEPKSSAQLL 
BDWGMEDIDHVFSEEDYRTLTNYKAFSQPVRPLIAAKNPKIAVS 
KMMMVLGAKWR2FSTNNP FKGSSGAS VAAAAAAAVAWESM VTA 
TEVAPPPPPVEVPIRKAKTKEGKGPNARRKPKGSPRVPDAKKPK 
PKKVAPLK I KLGGFGS KRKRSSSEDDDLDVESDFDDASINSYS V 
SDGSTSRSSRSRKKLRTTKKKKKGEEEVTAVDGYETDHQDYCEV 
CQQGGEIILCDTCPRAYHMVCLDPDMEBCAPEGKWSCPHCEKEGI 
QWEAKEDNSEGEBILEEVGGDLEEEDDHHMEFCRVCKDGGELLC 
CDTCPSSYHIHCLNPPLPEIPKGEWLCPRCTCPAIjKGKVQKILI 
WKWGQPPSPTPVPRPPDADPNTPSPKPLEGRPERQFPVKWQGMS 
YWHCSWVSELQ1*SLHC\QVMFRNYQRKNDMDEPPSGDFGGDEEK 
S\RKRKNKDPKFAEMEBRFYRYQIKPEW\MMIHRII,NHSVDKKG 
HVHYLIKWRDLPYDQASWESEDVEIQDYDLFKQSYWNHRELMRG 
ESGRPGKKLKKVKIiRKIjERPPETPTVDPTVKYBRQPEYLDATGG 
TLHPYQMEGIiNWLRFSWAQGTDTILADEMGLGKTVQTAVFLYSL 
YKEGHSKGPFLVSAPLSTI IN\ WERE PEMWAPDM YV\VTYVGDK 
DSRAI IREKEFS \FEDNAIRGGKKASRMKKEASVKFHVLLTS YE 
IiITIDMAILGSIDWAGLIVDEAHRLKNNQSKPFRVLNGYSLQHK 
ZiLLTGTPLQNNLEELFHLLNFLTPERFHNLEGFLEEFADIAKED 
QIKKLHDMLG\PHMriRRLKADVPKNMP3KTEi:*IV\RVELSPM\Q 
KKYYK\YILHSKFIjK7U*N\ARGGGNQVSLLNVVMDLKKCCNHPY 
LFPVAAMEAPKMPNGMYDGSALIRASGKIiLLLQKMLKNLKEGGH 
RVL I FSQMTKMLDLLEDFLEHEG YKYERIDGGITGNMRQEAIDR 

fnapgaqqfcfllstragglgintatadtvi i ydsdwnphndi q 
afsrahrigqwkkvmiyrfvtrasveeritqvakkkmmlthlvv 
rpglgsktgsmskqelddilkfgteelfkdeatdgggdnkeged 

SSVIHYDDKAIERLLDRNQDETEDTELQGMNBYIiSSFKVAQYW 
REEEMGEEEEVBRE 1 1 KQEESVDPDYWBKLLRHHYEQQQEDLAR 
NLGKGKRIRKQVNYNDGSQBDRDWQDDQSDNQSDYSVASBEGDE 
DFDERSEAPRRPSRKGLRNDKDKPLPPLLARVGGNIEVLGFNAR 

qrkaflnaimrygmppqdafttqwi*vrdi»rgksekefecayvsi.f 
mrhlcbpgadgaetfadgvpreglsrqhvltrigvmslirkkvq 

E FEHVNGRWSMPELAE VEENKKMSQPGS PSPKTPTPSTPGDTQP 
NTPAPVP PAEDG I KIEENS LKEEBS I EGEKEVKSTAPETAI ECT 
QAPAPASEDEKVWEPPEGEBKVEKAEVKERTEEPMETEPKGKG 
AADVEKVEEKSAIDLTPIWEDKEEKKEEEBKKEVMLQNGETPK 

nT.MnTrTfriWMTTfrtPi?ivri7NrT3ir*r2r'tm?T.wcT Mn'Kri?T?DaR*nf'nrv'p 
JJ Dv* UCij\\i JUUX J, J\\^Kr ylCSS ±J\U\s \3C 1 JQ JjXlO LtW UIN iL xCAA 1 V 1 JVKT 

YEI WHRRHDYWLLAGI INHGYARWQD IQITOPRYAI LNSPFKGEM 
NRGNFIiEIKNKFLARRFKLLEQAIjVIEEQlaRRAAYLNMSEDPSH 
PSMALNTRFAEVECLAESHQHIiSKESMAGNKPANAVLHKVLKQL 
EELLSDMKADVTRLPATIARIPPVAVRLQMSERNILSRIiANRAP 
EPTPQQVAQQQ 


6006 


1 


965 


DNDFLRNT VHRHEPPVTAEP I RULJ^ENE DWWDKPSS I PVHPC 
GRFRHNTVIFILGKEHQLKELHPLHRLDRLTSGVLMFAKTAAVS 
ERIHEQVRDRQLEKEYVCRVEGEFPTEEVTCKEPILWSYKVGV 
CRVDPRGKPCETVFQRIiSYNGOSSWRCRPLTGRTHQrRVHLQF 
LGHPILNDPIYKISVAWGPSRGRGGYIPKTNBELI1RDI.VAEHQAK 
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SEQ 
ID 
NOt 


Predicteci 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signai peptide 
{AsAlanine, C^^Cyeteine, DsAspartic Acid, E= 
Glutamic Acid, F** Phenyl alanine, G^Glycine, 
H=Histidine, I^Isoleuclne, KsLysine, 
L= Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q^Glut amine, Rt=Arginine, 
S=Serine, TaThreonine, V»Valine, 
WsTryptophan, Y=ryro3ine, X=Unknoiim, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QSLDVUDI,CEGDIiSPGLTDSTAPSSELGKDDLEELAAAA\QKME 
EVAEAAPQEI>DriMiASEKAVETDVMNQ\RQT\TLCRVPAGATG 
SIiAPRPCDVPTCPTL 


6007 


3 


2351 


HELGQVE Y VFTDKTGTLTENEMQPRECS INGMKYQE INGRLVPE 
GPTPDSSBGNLSYLSSLSHLNNLSHLTTSSSFRTSPENBTELIK 
EHDLPPKAVSLCHT VQINNVQTDCTGDGPWQSNIiAPS QLE YYAS 
SPDEKALiVBAAARIGlVFIGNSEETMEVKTljGKLERYKLLHILE 
FDSDRRRMSVIVQAPSGEKLLFAKGAESSILPKCIGGEIEKTRI 
HVDEFALKGLRTLCIAYRKFT£KEYEEIDKRZFEARTALQQR\E 
E KIi AAV FQF 1 EKOL X LliG ATAVEDRLQDKVRET 1 BALRMAGl KV 
MVLTGDKHETAVS VSLS CGHFHRTMNILELINQKSDSE CAEQLR 
QLARRXTEDHVIOHGLWDGTSLSLALREHEKLPMEVCRNCSAV 
LCCRMAPLQKAKVIRLI KI SPEKPITLAVGDGANDVSMIQEAHV 
GIGIMGKEGRQAARNSDYAIARFKFLSKLLFVHGHFYY3RIATL 
VQYFFYKNVCFITPQFIiYQFYCLFSQQTLYDSVYLTLYNNICPT 
SLPI LI YSLLEQHVDPHVLQNKPTLYRDISKNRLLS IKTFLYWT 
ILGFSHAFIPPPGS YLLIGKDTS LLGKTGQMFGNWTFGTLVFT VM 
VITVTVKMALETHPMTWINHLVTWGS I IFYFVPSLPYGGILWPP 
LGSQNM YFVFIQIJjSSGSAWFAI IIiMWrCLFLDI IKKVFDRHL 
HPrSTEKAQLTETNAGIKCLDSMCCFPEGEAACASVGRMLERVI 
GRCSPTHI SRS WSASDPFYTNDRS ILTLSTMDSSTC 


■■ ^008 


45S4 


1089 


AGVRRAGARRGPGRAIjPAGATAVPPPSARRRRRCPAPEHAGPAR 

asrpsqetmfqiipvnnlgs lrkftrktvkkilsd igle yckehib 
dpkqfepndfylknttwedvgtiwdpsltknqdyrtkpfccsacp 
fsskffsaykshfrljvhsedfenrillncpyctfiradkktleth 
i ki fhapnasaps ssl5tfkdknkndglkpkqads ve qavyyck 
kctyrdplyeivrkhiyrehfqhvaapyiakageksi*ngavplg 
snarees s ihckrclfmpks yealvqhviedher iq yqvtamig 
htnvwprskplmliapkpqdkksmglpprigslasgnvVrslp 
sqqmvnrlsipkpnlnstgvnmmssvhiiqqnnygvksvgqgysv 

GQSMRLGLGGNAP VS I PQQSQSVKQLLPSGNGRS YGLGSEQRSQ 
APARYSLQSANASSLSSGQLKSPSLSQSQASRVLGQSSSKPAAA 
ATGPPPGNTSSTQKWKICTICNEIiFPENVYSVHFEKEHKABKVP 
AVANYIMKIHNPTS KCLYCNRYLPTDTLLNHML IHG LSCPYCRS 
TFNDVEKMAAHMRMVHIDBEMGPKTDSTLSFDLTLQQGSHTNIH 
LLVTTYNliRDAPAESVAYHAQNKPPVPPKPQPKVQEKADI PVKS 
S PQAAVP YKKDVGKTLCPLCFS ILKGP ISDALAHHLRERHQVI Q 
TVHPVEKKIiTYKCIHCLGVYTSNMTASTITLHLVHCRGVGKTQW 
GQDKIWAPSRLNQSPSLAPVKRTYEQMEFPLLKKRECLDDDSDSP 
S FFEEKPEEPWLALDPKGH\EDDSYEARKSPLTKYFT\ KQPYP 
TRREIEKLAASLWV\WK\SDIASHPSNKRKKCVRDCEBCYKPGVL 
LGFNMKELNKVKHEMDF^AEGLFENHDEKDSRVNASKTADKKIiN 
LGKEDDSSSDSPENIJSE3SNESGSPFDPVPEVBPKISNDNPEEH 
VLKVIPEDASBSEEKLDQKEDGSKYETIHLTEEPTKLMHNASDS 
EVDQDDWEWKDGASPSESGPGSQQVSDFEDNTCEMKPGTWSDE 
SSQSSDARSSKPAAKKKATMQGDREQLKWKNSSYGKVEGFWSKD 
QSQWKNASEKDERLSNPQI EWQNSTIDS EDGEQFDNMTDGVAE P 
MHGSLAGVKLSSQQA 


6009 


4272 


1534 


CHGLQHLTPFRELNLSLQG*EPH*AA*QAVRSEEKSIC*GSPSC 
HLVLGVLVP VARQSSHSAG PAQSAFR*TGTGSGTPKAAEQS GYW 
EAYTLGHQHWNMFPIQRPPLVMKGRRIMCGKCEKG*VSDSVTGG 
RAVAGEQASQRRTVFTAGGGECIiGAKSVRASVFTGNQPGVMGLL 
NGKRGGCFESGYLPGPIVIGKIQSLEAKVPLPWGQTGERASPG 
NCRIHI VDAVC* SEHH* DHFLAAAFLENSTI IS * VAPGS WQDHA 
VIiQKEVQASVRCRGFESVDTAPAGFWAHSPPGLQGEPTTTSVSL 
FVLAPQDGEGVPFVEGQLVTVLGLWPQS I RHTFVHHTQIiPLHP 
I * KLGALDVAFLHLLTLVCS S PKVAYG *GKNGGTTtiHQLFAEVN 
AVTRGSAVQRRPSITISSIHVDTKIQQELHDVMVAGADGWQWG 
DPFWGLAGI FHLIDDPtiHQ lELS FQRRV*EQCQGVKPDSQPVP 
RPXjRVGLLQVGPLVRGGGRRVAGRGKRCWRDLLFPWRWGLSHRT 
RDLLRGGDRGHVWIVLCRLGSLVGGLGTDELLWP6GR*IiIIIG 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

c or r e sponding 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E=3 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
II=IIistidine, Islsoleucine, K=liysine, 
L-Leucine , M=Met hionine , N=JVsparagine , 
P=Proline, Q=Glutamine , R^rArginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknovni, *-Stop 
Codon, /=possible nucleotide deletion, 
\«possible nucleotide insertion) 








I * * RGRLSGEWGCGLGRGELFQVSIGIGVS I VHIGQGDHEVIiGG 
AGLVERGALHATGQGVEALVQQLLDVGPAGALGLCDGAALFCX3P 
GRVGQLPAEGLQVCITLVAQWRMHDGRELGGAEWPWQALHGAAI 
CGVGGAILLKALSQYFLKGG*RLWCARGQ * P VKKRQRRWRG* TR 
R *NGLTIHCFN* LI * GAVCCRLVILRWCGLLEVHGVYGT* IHCL 
GSFPGRLWP*PPISQERPNGHCQWEFRLAVPSWKCRWSRWRVRG 
TWRyGNPLLNLL*GAWLGGAACGGQQGGPLSTWQACTGPaQAAP 
LPPFQGACRPRTQRCRTWCPIAWRQLLAYTRD 


60X0 


1 


3533 


IMPCGSSRLLRGCWTHPNEPVSDLSYFDCIESVMENSKVLGESM 
AGISQNAKTGDLPAFGECVGIASKALCGLTEAAAQAAYLVGIFD 
PNSQAGHQGLVDPIQFARANQAIQMACQNLVDPGSSPSQVLSAA 
TIVAKHTSALCNACRIASSKTANPVAKRHFVQSAKEVANSTANL 
VKTI KALDGDFS EDNRNKCRI ATAPL I EAVENLTAFASNPEF VS 
IPAQISSEGSQAQEPILVSAKPMLESSSYI.IRTARSLAINPKDP 
PTMS VLAGHSH? VS DS I KSLI TS I RDKAPGQRECDYS IDG INRC 
IRDIEQASLAAVSQSLATRDDISVEALQEQLTSWQEIGHIjIDP 
lATAARGEAAQLGHKGTQLASYFEPLILAAVGVASKILDHQQQM 

tvldqtktlaesalqmlyaakegggnpkaqhthdaitsaaqlmk 
eavdd imvti.neaasevglvggmvdai aeams kiidegtppe pkg 
tfvdyqttwkyskaiavtaqemmtksvtnpbelgglasqmtsd 
yghlafqgqmaaataepeeigfqirtrvqdlghgciflvqkagX 
alqvcptdsytkreliecaravtekvslvlsalqagnkgtqaci 

TAATAVSGIIADLDTTIMFATAGTIjNAENSETFADHRENILKTA 
KALVEDTKLLVSGAAST PDKLAQAAQS S AAT I TQLAEWKLGAA 
SLGSDDPETQWLINAl KDVAKALSDL ISATKGAASKPVDDPSM 
YQLKGAAKVMVTNVTSLLKTVKAVEDKATRGTRALBATIECIKQ 
ELTVFQSKDVPEKTSSPBESIRMTKGITMATAKAVAAGNSCRQE 
DVIATANLSRKAVSDMLTACKQASFHPDVSDEVRTRALRFGTEC 

tlgyldllehvlvii.qkptpei1kqqlaafskrvagavteliqaa 
kamkgtewvdpbdptviastellgaaas i eaaakkiieqlkprak 
pkqadetldfeeqilbaaksiaaatsalvksasaaqrelvaqgk 
vgsipanaaddgqwsqgli saarmvaaatsslceaanasvqgha 
seekiiissakqvaastaqi.i,vackvkadqdseat1rrlqaagwav 

KRASDNLVRAAQKAAFGKADDDD WVKTKFVGG I AQI lAAQEEM 
LKKERELEEARKKLAQIRQQQYKFIiPTELREDEG 


6011 


446 


1835 


LLQPAMRKSPGLSDCLWAWILIjLSTLTGRSYGQPSLQDELKDNT 
TVFTRILDRLLDGYDNRLRPGLGERVTEVKTDIFVTSPGPVSDH 
DMEYTIDVFFRQBWKDBRLKPKGPMTVLRIiNNLMASKrWTPDTF 
FHNGKKS VAHNMTMPNKLLRI T3DGTIiLYTMRLTVR\AECPMAF 
GRDFPM\D\AHACPLKFGSYAYTRAEVVYEWTREPARSVWAED 
GSRLNQYDIiLGQTVDSGlVQSSTGEYWMTTHFHLKRKIGYFVI 
QTYLPCIMTVI LSQVSFWLNRESVPARTVFGVTTVLTMTTLS IS 
ARNSLPKVAYATAMDWPXAVCYAPVFSAIiIEFATVNYFTKRGYA 
WDGKS WPEKPKKVKDPLI KKNNTYAPTATSYTPNLARGDPGLA 
TIAKSATIEPKEVKPSTKPPEPKKTFNSVSKIDRLSRIAFPLLF 
GIPNLVYMATYLNRE PQLKAPTPHQ 


€012 


351 


S013 


PAELFQSFAIWHKELYDWRLG P WNQCQPVI skslekpleci KGE 
EGIQVREIACIQKDKD IPAED 1 1 CEYFEPKPL bEQACLI PCQQD 
CIVSEFSAWSECSKTCGSGLQHRTRHWAPPQFGGSGCPNLTEF 
QVCQSSPCEAEELRYSLHVGPWSTCSMPHSRQVRQARRRGKNKE 
REKDRSKGVKDPEARELIKKKRI^RNRQNRQENKYWD I QIGYQTR 
EVMCINKTGKAADLSFCQQEKLPMTFQ3CVITKECQVSEWSEWS 
PCSKTCHDMVSPAGTRVRTRTIRQFPIGSEKECPEFEEKEPCIiS 

qgdgwpcatygwrttewtecrvdpllsqqdkrrgnqtalcggg 

IQTRE VYCVQANBNLLSQLSTHKNKEASKPMDLKLCTGP I PNTT 

qlchipcptecbvspwsawgpctyencndqqgkkgfklrkrrit 
neptggsgvtgncphll3aipceepacydwkavrlgdcepdngk 
ecgpgtqvqewcinsdgeevdrqlcrdai fpi pvacdapcpkd 
cvlstwstwsscshtcsgkttegkqirarsiiayageeggircp 
nssalqevrscntehpctvyhwqtgpwgqciedtsvssfnttttw 
ngeascsvgmqtrkvicvrvnvgqvgpkkcpeslrpbtvrpcll | 
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SEQ 
ID 
NO: 


PrecSicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
secpience 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D«Aspartic Acid, £=» 
Glutamic Acid, F«* Phenyl alanine, G=Glycine, 
H»Hi3tidine, I=Isoleucine, K=Lysxae, 
LsLeucine , M=Methionine , N=Asparagine , 
PaProline, Q=<3lutamine, Rz^Arginine, 
SsSerine , T=Threonine , V=Valine , 
W=Tryptophan* Y=Tyrosine, XssUnknoMn, *:=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PCKKDCIVTPYSDWTSCPSXSCKEGDSSIRKQSRHRVIIQLPAN 
GGRDCTDPLYEEKACEAPQACQSYRW\KTHKW\HRCQ\LVP\WS 
VQQDS P \GAQEGCGPGRQARAI TCRKQDGGQAGI HECLQ YAGPV 
PAIiTQACQIPCQDDCQLTSWSKFSSCNGDCGAVRTRKRTIiVGKS 
KKKEKCKNSHLYPLIETQYCPCDKYNAQPVGNWSDCILPEGKVE 
VLLGMKVQGDIKECGQGYRYCaAMACYDQNGRLVETSRCNSHGYI 
EEACIIPCPSDCKLSEWSNWSRCSKSCGSGVKVRSKWLREKPYN 
GGRPCPKLDHVNQAQVYEWPCHSDCNQYLWVTEPWS I CKVTPV 
NMRENCGEGVQTRKVRCMQNTADGPSEHVEDYLCDPEEMPLGSR 
VCKLPCPEDCVISEWGPWTQCVLPCNQSSFRQRSJa>PIRQPADE 
GRSCPNAVEKEPCNLNKMCYHYDYWVTDWSTCQLSEKAVCGNGI 
KTRMLDCVRSDGKSVDLKYCEALGLEKNWQMNTSCMVECPVNCQ 
LSDWSPWSECSQTCGLT6KMIRRRTVTQPFQGDGRPCPSLMDQS 
KPCPVKPCYRWQYGQWSPCQVQEAQCGEGTRTRNISCWSDGSA 
EDFSKWDEEFCADIBLIIDGNKNMVIiEESCSQPCPGDCYLKDW 
SSWSLCQLTCVNGEDLGFGGIQVRSRPVlIQELENQKLCPEQMIi 
ETKSCYDGQCYEYKWMASAWKGSSRTVWCQRSDGINVTGGCLVM 
SQPnADRSCNPPCSQPHSYCSETKTCHCEEGYTEVMSSWSTLEQ 
CTI»IPWVLPTMEDKRGDVKTSRAVHPTQPSSNPAGRGRTWFLQ 
PFGPDGRLKTWVYGVAAGAFVLLIFIVSMIYLACKKPKKPQRRQ 
NNRLKPLTLAYDGDADM 


6013 


1161 


710 


GAFIAGVPVQPVLIRYPNSUDTTSWAWRGPGVLKVLWLTASQPC 
SIVDVBFLPVYHPSPEESRDPTLYANNVQRVMAQALGIPATECE 
FVGSLPVrWGRLKVALEPQL/WGTGKSASEGWAVRKLCGRWGR 
ARPESNDQPGRVCQAATAL 


5014 


2857 


613 


EAVAGGMEKSRMNLPKGPDTLCFDKDSFMKEDFDVDHFVSUCRK 
RVQLEELRDDLELYYKLLKTAMVELINKDYADFWNLSTNLVGM 
DKALITQLSVPLGQLREBVLSLRSSVSEGIRAVDERMSKQEDIRK 
KKMCVLRLIQVIRSVEKIEKILNSQSSKETSALEASSPLLTGQI 
LERIATEFKQLQFHACQSK\GMPLLDKVRPRIAGITAMLQQSbE 
GLLLEGLQTSDVDIIRHCLRTYATIDKTRDAEALVGQVLVKPYI 
DEVIIEQFVESHPNGLQVMYNKLIiEFVPHHCRLLREVTGGAISS 
EKGNTVPGYDFLVNSVWPQIVQGLEEKLPSLFNPGNPDAFHSKY 
TISMDFVRRLERQCGSOASVKRLRAHPAYHSFNKKWNLPVYFQI 
RFREIAGSLEAALTDVLBDAPAESPYCLLASHRTWSSLRRCWSD 
EMFLPLLVHRLWRLHSGRFWARYSVFV\N\ELSriRPISNESPKE 
IKKPLVTGSKEPS ITQGNTEDQGSGPSETKP WS ISRTQijVyW 
ADLDKLQEQLPELLEIIKPKLBMIGPKNPSSISAALEDSQSSPS 
ACVPSLSSKlIQDLSDSCFGFIiKSALEVPRLYRRTNKEVPTTAS 
SYVDSALKPLFQLQSGHKDKLKQAIIQQWLEGTIjSESTHKYYET 
VSDVLNSVKKMEESLKRIiKQARKTTPANPVGPSGGMSDDDKIRL 
QLALDVEYLGEQ IQKLGLQASDIKSFSALAELVAAAKDQATAEQ 
P 


6015 

> 


13 


2237 


AEGCAERRGTEPWELSMSWSSGAGPGLGSQGMDLVWSAWYGKC 
VKGKGSX.PLSAHGIWAWLSRAEWDOVTVYLPCDrHKLQRYAIzN 
RITVtrRSRSGNELPLAVASTADLIRCKLLDVTGOLGTDEIiRLLY 
GMALVRFVNLISERKTKFAKVPLKCLAQEVNIPDWIVDLRHEIiT 
HKKMPHINDCRRGCYFVLDWLQKTYWCRQLENSLRETWELEEFR 
EGIEEEDQEEDKNIWDDITEQKPBPQDDGKSTESDVKADGDSK 
GSEEVDSHCKKALSHKELYERARELLVSYEEEQFTVLEKFRYLP 
KAIKAWNNPSPRVECVIAEIiKGVTCEMREAVLDAFLDDGPLVPT 
PEQIiAALQIEYEENVDLNDVLVPKPFSQFWQPLLRGLHSQNFTQ 
ALLERMLSELPALGISGIRPTYILRWTVELIVANTKTGRNARRF 
SAGQWEARRGWRLFNCSASLDWPRMVESCLGSPCWASPQLLRXI 
F\KAMGQGLQDB\EQEKLLRICSIYTQSGENSLVQEGSEASPIG 
KSPYTLDSLYWSVKPASSSFGSEAKAQQQEEQGSVNDVKEEEKE 
EKEVLPDQVEEEEENDDQEEEEEDEDDEDDEEEDRMEVGPFSTG 
QESPTAENARLLAQXRGALQGSAWQVSSEDVRWDTFP\LGRMPR 
SRPRTPAELMIiENYDTHVIFWTKPVL\EQRLEPSTCK\TDTLGL 
\SCGVGS\GKCSNSSSSNFRGAFLLEARGSLH\G1,\KTGLQLF 


6016. 


13 


2237 


ASGCAERRGTEPWEIiSMSWESGAGPGLGSQGMDLVWSAWYGKC | 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E« 
Glutamic Acid, PssPhenylalanine , G=Glycine, 
HsHistidine, I»Isoleucine, K= Lysine, 
L=I*eucine , Mt=Methionine , N^Asparagine , 
P=Proline, Q=Glut amine, R=Arginine, 
S»Serine, T~Threonine, Vt=Valine, 
W-Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion^ 
\=possible nucleotide insertion) 








VKGKGSLPIiSAHGIWAWLSRAEWDQVTVYLPCDDHKIiQRYALN 
R ITVWRS RSGNELPLAVASTADL IRCKLLDVTGGLGTDELRIiLY 
GMALVRPVNLI SERKTKPAKVPLKCIiAQEVNIPDW X VDIiRHELT 
HKKMPKINDCRRGCYFVLDWLQKTYWCRQLENSLRETWELEEFR 
EGI EEEDQEEDKNI WDDITEQKPEPQDDGKSTESDVKADGDS K 
GSEEVDSHCKKALSHKELYBRARELLVSYEEEQFTVLEKFRYLP 
KAI KAKNNPS PRVEC VLAELKGVTCENRBAVLDAFLDDGPLVPT 
FBQLAALQIEYBENVDLNDVLVPKPFSQFWQPLLRGLHSQNFTQ 
ALLERMLSELPALGISGIRPTYILRWTVELIVANTKTGRNARRF 
S AGQWEARRGWRLFNCS ASLDWPRMVESCLGS PCWAS PQLLRI I 
F\KAMGQQLQDE\EQBKLLRI CS I YTQSGENSLVQEGSEAS PIG 
KSPYTLDSLYWSVKPASSSFGSBAKAQQQEEQGSVNDVKEEEKE 
EKEVLPDQVEEEEENDDQEEEEEDEDDBDDEEKDRMEVGPFSTG 
QES PTAENARLLAQKRGAXiQGSAWQVSSEDVRWDT FP\liGRMPR 
SRPRTPAELMLENYDTHVIFWTKPVL\EQRLEPSTCK\TDTLGL 
\SCGVGS\GNCSNSSSSNFRGAFLLEARGSLH\GL\KTGLQLF 


6017 


203 


34^5 


SHQE lEQNSAMAPRKRGGRGIS F I FCCFRNNDHPE I TYRLRNDS 
NPALQTMBPALPMPPVEEIiDVMFSELVDELDLTDKHREAMFALP 
ABKKWQIYCSKKKDQEENKGATSWPSFYIDQLNSMAARKSLIAL 
EKBEEBERSKT I ESLKTALRTKPMRFVTRPl DIjDGliSC I LNFLK 
TMDYETSESRIHXSIil GCI KA1.MNNSQGRAHVLAHSES INVI AQ 
SLSTENIKTKVAVLEI LGAVCLVPGGHKKVLQAMLHYQKYASER 
TRFQTLINDLDKSTGRYRDEVS LKTAIMSFINAVLSQGAGVESL 
DFRLHLRYHN FLMLG IHP VMDKLRKHENSTLDRHLDFFEMLRNE 
DBLEFAKRPELVHIDTKSATC^PELTRKRLTHSEAYPHFMSILK 
HCLQWP YKRSGNTVQYWLIiLDR 1 2QQI VIQNDKGQDPDSTPLEW 
FKIKNVVRMLVNENEVKQWKEQAEKMRKEMimiQQKLBKKEREC 
DAKTQEKEEMMQTLNKMKEKtiEKETTEHKQVKQQVAELTAQliHE 
LSRRAVCAS I PGGPSPGAPGGPFPSSVPGSLLPP PPPPPLPGC3M 
IiPP P PPPLPPGGPPPPPGPPPIiGAIMP PPGAPMGIALKKKS I PQ 
PTNALKS FNWS KLPENKLEGTVWTEIDDTKVPKILDLEDLERTF 
SAYQRQQDFFVWSNSKQKEADAIDDTLSSKLKVKELSVIDGRRA 
QNCNILLSRLKLSNDEIKRAXLTMDEQSDLPKDMLEQLLKPVPE 
KSDIDLLEEHKHELDRMAKADRFLFEMSRINHYQQRDQSLYFKK 
KPAERVAEVKP KVEAIRSGSEEVFRS GALKOLDEWIAFGNYMNT 
KGQRGNAYGFKI SSLNKIADTKSS I DKNI TLLHYLI TI VENKYP 
S VLNLNEELRD I PQAAKVNMTELDKE I STLRSGLKAVETELE YQ 
KSQPPQPGDKFVSWSQPI TVASFS FSDVEDLIAEAKDLFTKAV 
KHFGEEAGKIQPDEFFGIFDQPLQAVSEAKQEKENMRKKKEEEE 
RRARMEAQLKEQRERERKMRKAKENSEESGBFDDLVSALRSOEV 
FDKDLSKLKRNRKRITNQMTDSSRERPITKLNF 


eols 


13 


2510 


T ISQSGG I RRRREAVWFE WNMDFSRLHMYS PPQCVPENTGYTY 
ALSSSYSSDALDFETEHKIjDPVFDSPRMSRRSLRIiATTACTLGD 
GEAVGADSGTSSAVSIiKNRAARTTKQRRSTNKSAFSINHVSRQV 
TSSGVSYGGTVSLQDAVTRRP PVLDES wi REQTTVDHFWGLDDD 

gdlkggnkaaiqgngdvgagaatghngfpcsncnmls3rkdvlt 
ahpaapgpvsrvysrdrnqkcddckgkrhldahpgragtlwhiw 
acagyfllqilrrigavgqavsrtawsalwlawapgkaasgvf 

WWLGIGWYQFVTLISWLNVFLLTRCLRNICKFIiVLLIPLFLLLG 
LSLRGQGXNPFSPIiPVLNWASMHRTQRVDDPQDVFKPTTSRLKQ 

plqgdseafpwhwmsgveqqvaslsgqchhhgenlreiittllqk 
lqarvdqmeggaagpsasvrdavgqppretdfmafhqehevrms 
hlbdilgklrekseaiqkelsqtkqktisavgeqllptvehlql 
eldqlkselsswrhvktgcetvdavqervdvqvremvkllfsed 
qqggslbqllqrpssqpvskgdlqtmlrdlqlqilrnvthhvsv 
tkqbptseawsavseagasgltbaqaraivnsalkiiysqdktg 

MVDFALBSGGGSILSTRCSBTYETKTALMSLFGIPXiWYFSQSPR 
WrQPDIYPGNCWAFKGSQGYLWRLSMMIHPAAFTLEHIPKTL 
SPTGNISSAPKDPAVYGLENEYQEEGQLLGQFTYDQDGESUSMF 
QALKRPDDTAFQIVELRIPSNWGHPEYTCLYRPRVHGBPVK 


6019 


2 


1066 


TPNDREPPPQRPPSSRRASHLAQEI TSTtfVSLGDQXQ I LGSLTTA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine^ C=Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, F~Phenylalanine , G=Glycine^ 
Hi^^Histidine , lelsoleucine, K^Lysine, 
LsLeucine, M^Methionine, N=Asparagine, 
P=Proline, Q=Glutatnine, R=Arginine, 
S=Serine, T^* Threonine, V=Valine, 
w- Tryptophan, Y^Tyrosine, Xs^Unknown, *-Stop 
Codon, /=3poasible nucleotide deletion, 
\=possible nucleotide insertion) 








PVITSA IRSMPG IS SQILTNAQGQVIGTLPWWNSASVAAPAPA 
QSIjQVQAVTPQLLLNAQGQVIATLASSPIiPPPVAVRK\PSTPES 
LLKSBVQPIKPTPTVPQPAWIASPAPAAKPSASAPrPITCSKT 
PTVSQLVSKPHTPSLDEDGINLEEIRSFAKNFKIRRLSLGLTQT 
QVGQALTATEXSPAYSQSAICRFEFOiDITPKSAQKLKPVLEKWLN 
KAELRNQEGQQNLM BFVGGEPSKKRKRRTS FTPQAIEALNAYPE 
KNPLPTGQE ITE I AKELNYDREVVRVWFC35RRQTLKNTS KLNVF 
QXP 


6020 


4953 


549 


EAIQFBVSIGNYGNKFDTTCKPLASTTQYSRAVFDGNYYYYIjPW 
AHTKPWTLTSYWEDI SHRLDAVNTLIiAMAERLQTNIEALKSG I 
QGKIPANQLAELWLKLIDEVIEDTRYTLPI.TBGKANVTVIiDTQI 
RKLRSRSLSQ IHEAAVRMRSKATD VKSTLAE I SDWLDKLMQLTE 
EPQNSy.PD III WMIRGEKRLAYAR I PAHQVLYSTSGENASGKY C 
GKTQTIFLKYPQEKNWGPKVPVELRVN3WLGLSAVEKKFNSFAE 
GTFTVFAEMYENQALMFGKWGTSGLVGRHKFSDVTGKIKLKREP 
FLPPKGWEWEGEWIVDPERSLLTEADAGHTEPTDEVYQNESRYP 
GGDWKPAEDTYTDANGDKAAS PSELTCPPGWEWEDDAWS YDINR 
AVDEKGWEYGITIPPDHKPKSWVAAEKMYHTHRRRRLVRKRKKD 
LTQTASSTAGAMEEIiQDQEGWEYASLIGWKPHNKQRSSDTFRRR 
RWRRKMAPSETHGAAAI FKIjEGALGADTTEDGDEKSLBKQKHSA 
TTVFGANTP I VSdCFDRDY I YHLRCYVyQARNLLALDKDS FSDP 
yAHICPLHRSKTTEIIHSTLNPTWDQTIIFDEVEIYGBPQTVLQ 
NPPKVIMELPDNDQVGiCDEFLGRS I PSPWKLNSEMDITPKLLW 
HPVMNGDKACGDVLVTABLILRGKDGSNLP I LPPQRAPNLYMVP 
QGI RPWQLTAIEILAWGLRNMKNFQMASI TSPS LVVBCGGBRV 
ESWIKNLKKTPNFPSSVLFMKVFLPKEELYMPPLVIKVIDHRQ 
FGRKPWGQCTIERLDRFRCDPYAGKEDIVPQLKASLLSAPPCR 
DIVIEMEDTKPLIASKCLSSMSTALSKMASPATVHLTEKEEEIV 
DWWS KFYASSGEHEKCGQ YIQKGYS KLKTYNCELENVAE PEGLT 
DFSDTFKLYRGKSDENEDPSWGEFKGSFRIYPLPDDPSVPAPP 
RQFREIf PDSVPQECTVRI YIVRGLELQPQDNNGLCDPy I KITLG 

kkvieNdrdhyipntlnpvfgrmyelscylpqekdlkisvydyd 
tftrdekvgbti xdlenpp\lsrfq\shcg\ ipeeycvsgvntw 
rdslrVptqSllqnvarfkgppqpilsedgsriryggrdyslde 

FEANKIIiHQHLGAPEERIiALHILRTaGIiVPEHVETRTrjJSTFQP 

nis\ryylrvi itvntkdvildeks itgeemsdi yvkgwi pgnee 
nkqktdvhyrsldgegnfnwrfvppfdylpaeqlcivtikkehfw 
sidqtefrippr\liiqiw\dndkfs\lddylgpprtt>tcrhti 
hplqks pggnc /rgldmi pdlkt^mnplkaktaslfbqksmkqiw 
pcyaekdgarvmagkvemtleilnekeadbrpagkgrdepnmnp 
kidlpnrpetsflwftnpcktmkfi wrrfkwvl igllfliil ll 

LFVAVLLYSLPNYIiSMKIVKPNTV 


6021 


4953 


549 


EAIQFEVSIGNYGNKFDTTCKPLT^TTQYSRAVFDGNYYYYLPW 
AHTKP WTLTSYWEDI SHRIiDAVNTLLAMAERLQTNIEALKSG I 
QGKIPAKQLAELWLKLIDEVIEDTRYTLPLTEGKANVTVLDTQI 
RKLRSRSLSQIHEAAVRMRSEATDVKSTIiAEIEDWLDKLMQLTE 
EPQNSMPDIIIWMIRGEKRLAYARIPAHQVLySTSQENASGKYC 
GKTQTI FLKYPQEKNNGPKVPVELRVNI WLGLSAVEKKFNSFAE 
GTFTVFAEMYEKQAIjMFGKWGTSGLVGRHKFSDVTGKIK]UKREF 
FLPPKGWEWEGEWIVDPERSLLTEADAGHTEFTDEVYQKTESRYP 

ggdwkpaedtytdangdkaaspseltcppgwemeddawsydinr 
avdekgweygitippdhkpkswvaaekmyhthrrrrlvrkrkkd 
ltqtasstagameelqdqegweyasligwkfhwkqrssdtprrr 

RWRRKMAPS ETHGAAAI FKLEGJMjGADTTEDGDEKSLEKQKHSA 
TTVPGANTPr VSGKFDRDY I YHLRCYVYOARNLIALDKDSFSDP 
YAHI CFLHRSKTTE I IHSTLNPTWDQTI IFDEVE I YGEPQTVLQ 
NPPKVIMELFDNDQVGKDEFJU3RS I FSPWKLNSEMD I TPKLLW 
HPVMNGDKACGDVLVTAELILRGKDGSNLP I LPPQRAPNLYMVP 
QG IRPWQLTAIE ILAWGLRNMKNPQMAS ITSPSLWECGGERV 
ESWIKNLKKTPNFPSSVLFMKVFLPKEBLYMPPI^VIKVIDHRQ 
FGRKPWGQCTIERUDRFRCDPYAGKEDIVPQLKASLLSAPPCR 
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SEQ~ 
ID 
NO: 


Predicted ~~ 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corr e spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepti"ae~ 
(A=AZanine, C=Cysteine, D=Aspartic Acid, E=s 
Glutamic Acid, F=Phenylaaanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
Lt=Leucine, M=Methionine, N=Asparagine, 
P=Proline, a=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V«Valine, 
^"Tryptophan, Y«Tyrosine, X=Unknown, *=3top 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 


6022 






DIVIEMEDTKPDLASKCLSSMSTAIiSKMASPATVHLTEKEEEIV 
DWWSKFYASSGEHEKCGQyiQKGYSKLKlYlTCELENVAE^GLT 
DFSDTFKLYRGKSDENEDPSWGEFKGSFRIYPLPDDPSVPAPP 
RQFRELPDSVPQECTVRIYIVRGLELQPQDHNGLCDPYIKITLG 

kkvieXdrdhyipntlwpvfgrmyelscylpqekdlkisvydyd 

TFTRDEKVaETIIDI,ERPF\LSRPG\SHCG\lPEEYCVSGVNTW 
RDSLR\PTQ\LLQNVARFKGPPQPILSEDGSRIRYGGRDYSLDE 

peankilhqhlgapeerlalhilrtqglvpehvetrtlhstfqp 
kisXryylrviiwntkdvildeksitgeemsdiyvkgwipgneb 

NKQKTDVHYRSr»DGEGWFNWRFVPPPDVLPAEQLCIVAKKEHPW 
SIDQTEFRlPPR\LIIQIW\DNDKFS\LriDYLGPPRTi;,TCRHTI 
HFLQKSPGGNC/RGLDMIPDLKAMNPIiKAKTASLFEQKSMKGWW 
PCYAEKDGARVMAGKVEHTLEILNEKSADBRPAGKGRDEPNMNP 
KLDLPNRPBTSFLWPTNPCKTMKPIVWRRFCTTVIIGLLFLIiILL 
LFVAVLLYSIiPNYLSMKIVKPKV 


6023 


4953 


S4S 
916 ' 


EAIQFEVSJ.CiMYGNKFDTTCKPXASTTqYSRAVFDGMYYYYLPW'" 

ahtkpwtltsywedishrldavntllamaerlqtmiealksgi 

QGKIPANQIiAELWLKLIDEVIEDTRYTLPLTEGKA!WTVLDTQI 
RKLRSRSLSQIHEAAVRMRSEATDVKSTLAEIEDWLDKL^3QLTE 
EPQNSMPDI I IWMIRGEKRIAYARIPAHQVIiYSTSGENASGKYC 
GKTQTIFI.KYPQEKNNGPKVPVELRVNIMLGLSAVEKKFNSFAE 

gtftvfaemyenqalmfgkwgtsglvgrhkfsdvtgkiklkrsf 
elppkgwewbgewivdpersllteadaghteftdevyqnbsryp 
ggdwkpaedtytdangdkaaspseltcppgweweddawsydinr 
avdekgweygitippdhkpkswvaaekmyhthrrrrlvrkrkkd 
ltqtasstagameelqdqegweyasligwkfhwkqrssdtfrrr 
rwrrkmapsethgaaaipklegalgadttbdgdekslbkqkhsa 

TTVFGANTP I VSCKPDRDYI YHLRCYVYQARNLLALDKDSFSDP 

yahicflhrsktteiikstlnptwdqtiifdeveiygepqtvlq 

NPPKVXMELFDNDQVGKDEFLGRSIFSPWKLNSEMDITPKIiLW 

hpvmngdkacgdvlvtaelilrgkdgswlpilppqrapnlymvp 

QGIRPWQLTAIEILAWGLRWMKNFQMASITSPSLWECGGERV 

ESWIKKTLKKTPNFPSSVLFMKVFLPKEELYMPPLVIKVIDHRQ 

I'-GRKPWGQCTIERLDRFRCDPYAGKEDIVPQLKASLLSAPPCR 

DIVlEMEDTKPIiIASKCLSSMSTALSKMASPATVHLTEKEBEIV 

DWWSKPYASSGBHEKCGQYIQKGYSKLKIYWCELENVAEFEGLT 

DFSDTPKLYRGKSDENEDPSWGEFKGSFRIYPLPDD^SVPAPP 

RQPRELPDSVPQECTVRIYIVRGLELQPQDNKGLCDPYIKITLG 

KKVIE\DRDPIYIPNTIiNPVFGRMYELSCYLPQEKDLKrSVYDYD 

TFTRDEKVGETIIDLENPF\LSRFG\SHCG\IPEEYCVSG\7NTW 

RDSUR\PTQ\LLQNVARFKGFPQPILSEDGSRrRYGGRDYSLDE 

FEANKILHQHLGAPEERLALHILR1X3GLVPEHVETRTLHSTFQP 

NIS\RYYLRVX1WNTKDVILDEKSITGEEMSDIYVKGWIPGNEE 

NKQKTDVHYRSLDGEGNENWRPVFPFDYLPAEQLCIVAKKEHFH 

SIDQTEFRIPPR\LIIQIW\DNDKFS\LDDYL6FPRTLTCRHTI 

HFLQKS PGGNC/RGIiDMIPDLKAMNPLKAKTAS LFEQKSMKGW W 

PCYABKDGARVMAGKVEMTLEILNEKEADERPAGKGRDEPNMNP 

KLDLPNRPETSFLWFTNPCKTMKFIVWRRFKWVIIGLLFLLILL 
LFVAVLLYSLPNYLSMKIVKPNV 




6024 


102 




y ybU5MFVEI*NNLLNTtPDRAEQGKLTLLCDAiCTDGS FLVHHPL 

S FYLKANCKVCFVALIQSPSHYS rVGQKLGVSLTMARERGQLVF 

LEGL/lVCSGR\VPQAQKEPHPLQPLREArfAGNLKPLFEFVREA 

LKPVDSGEARWTYPVLLVDDLSVIiLSIjGMGAVAVIiDFIHYCRAT 

VCWELKGNMWLVHDSGDAEDEENDILLNGLSHQSHLILRABGL 

tVTGFCRDVHGQLRILWRRPSQPAVHRDQSFTYQYKIQDKSVSFF 
!^KGMSPAVL 




3 


3260 ] 
] 
i 
3 
1 


^LSFLCYPRPRc:i,FCLQFAIPASRMEQLNELBLU^KSPWEEAB 
^PAELFQKKWASFPRTVLSTGMDNRYLVLAVNTVQNKEGNCEK 

?lvitasqslbnkelcxi»rndwcsvpvepgdiihlegdctsdtw 

CIDKDFGYLILYPDfO^ISGTSIASSIRCMRRAVLSETFRSSDPA 
}RQMLIGTVLHEVFQKAINNSFAPBia.QEIJ^ 
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SEQ 
ID 
NO: 


Predlcced 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment: containing signal pepti<ie 
{A«Alanine. C=Cy3teine, D=Aspartic Acid, E=: 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
HoIIistidine, I=l3oleucine, K^Lysine. 
L«Leucine, M=Methionine, N=Asparagine, 
P^Prollne, Q=Glutamine, R=Arginine, 
S=iSerine, T:=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X-0nknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








YRLNLSQDJBIKQEVEDYltPSFCKWAGDFMHKNTSTDFPQMQLSL 

PSDNSKDNSTCNIEWXPMDIEES I WS PRFGLKGKIDVTVGVKI 

HRGYKTKyKIMPLELKTGKESNSIEHRSQWLYTLLSQERRADP 

EAG3^1^YLKTGQMyPV?ANHLDKRELLKLRWQMAFSLFHRI SKS 

ATRQKTQIASLPQIIEEEECTCKYCSQIGKCALYSRAVEQQMDCS 

SVPIVMLPKIEEETQHLKQTHLEYFSLWCLMLTLESQSKDNKKK 

HQNIWLMPASEMEKSGSCIGNLIRMEHVKIVCDGQYLHNFQCKH 

GAI P VTNLMAGDRVIVSGEERSLFALSRGYVKEINMTTVTCLLD 

RNl^SVLPESTLFRLDQEEKNCDlDTPIiGNLSKrjMENTFVSKKLR 

DLIIDFREPQFISYLSSVLPHDAKDTVACILKGUNKPQRQAMKK 

VLLSKDYTLIVGMPGTGKTTTICTLVRILYACGFSVLLTSYTHS 

AVDNIIiLKIAKFKlGFLRSRXQIQKVHPAIQQFTEHEICRSKSI 

KS \LALLEELYTSQL IDATTCMGINHPIFSRKI FDFCI VDEASQ 

ISQPICLGPIiFPSRRFVIiVGDHQQLPPLVriNREARALGMSESLP 

KRLEQNKSAWQLTVQYRMNSKIMSLSNKLTYEGKLECGSDKVA 

NAVINLRHFKDVKLELEFYADYSDNPWLMGVFEPNNPVCFLNTD 

KVPAPEQVEKGGVSNVTEAKLIVFLTSIFVKAGCSPSDIGIIAP 

YRQQIjKIINDIiLARSIGMVBVNTVDKYQDXRDKSIVLVSFVRSN 

KEGTVCELLKDWRRTiNTVAITRAKHKLILLGCVPSLNCyPPLEKL 

LNHLNSEKLI IDLPSHEHESLCHIL6DPQRE 


6025 


3977 


89 


GGFPAQSDHLPPVFPLRSDLLITMSTLYVSPHPDAFPSLRAIjIA 
ARYGEAGEGPGWGGAHPRICI*QPPPTSRTSFPPPRLPALEQGPG 

glwvwgatavaqllwpaglggpggsraavlvqqwvsyadtelip 
aacgatlpaiiglrssaqdpqavlgalgralspleewlrlhtyla 

GEAPTIJUJIJUlVTAIiLLPFRYVLDPPARRIWNNVTRWPVTCVRQ 

PEFRAVLGEWLYSGARPLSHQPGPEAPALPKTAAQLKKBAKKR 

EKLBKFQQKQKIQQQQPPPGEKKPKPEXREKRDPGVITYDLPTP 

PGEKKDVSGPMPDSYSPRYVEAAWYPWWEQQGPFKPEYGRPMVS 

AANPRGVFMMCIPPPNVTGSLHLGHALTNAIQDSLTRWHRMRGB 

rrLWNPGCDHAGIATQWVEKKLWREQGIiSRHQLGREAPLQEVW 

KWKBEKGDRIYHQLKKLGSSLDWDRACFTMDPKLSAAVTEAFVR 

LHEEGI lYRSTRLVNWSCTIiNSAISDI EVDKKBLTGRTLLSV^G 

YKEKVEPGVLVSPAYKVQGSDSDBEVWATTRIETMLGDVAVAV 

HPKDTRYQHLKGKNVIHPFLSRSLPIVFDEFVDMDFGTGAVKIT 

PAHDQNDYEVGQRHGLEAISIMDSRGALINVPPPFLGLPRPBAR 

KAVLVALKERGLPRGIEDNPMWPLCNRSKDWEPLLRPQWYVR 

CGEMAQAASAAVTRGDLRILPERHQRTWHAWMDNIREVWCMFPG 

KLWWG\HR\lPAyFVrVSDPAVPPGEDPDGRYWVSGRNEAEARE 

KAAKEFGVSPDKISLQQDEDVLDTWFSSGIiFPLSILGWPNQSED 

LS VFYPGTLIiETGEn?! LFFIfVARMVMLGLKLTGRLP FRE VYLHA 

IVRDAHGRKMSKSLQNVIDPLDVIYGISIiQGLHNQLLNSNLDPS 

EVEKAKEGQKRDFPAGlPECGTDALRFGrjCAYMSQGRDINLDVN 

RrLGYRHFCNKLWNATKFALRGLGKGFVPSPTSQPGGHESLVDR 

WIRSRLTEAVRLSI7QGPQAYDFPAVTTAQYSFWI.YELCDVYLEC 

LKPVLNGVDQVAAECARQTLYTCLDVCriRLLSPFMPFVTEELFQ 

RLPRRMPQAPPSLCVTPYPEPSECSWKDPEAEAALELALSITRA 

VRP\LHADYNIiHPESGPTCFrjEVAD\EATGALASAVSGYVQGPG 

QAQVWAVAEPWGLPAP\QGCAVALASDRCSI\HLQLQG\LLDP 

AREIiG\KLQ\AKRVEAQ\RQAQ\RLR\ERRA\ASGNPVKVPL\E 

VQEADEAKLQQTBAEIiRKVDEAIALFQKML 




6026 


2674 


S14 

1 


GPITFLKKKAKMKDMPLRIHVIjLGLAITTLVQAVDKKVDCPRLC 
TCEIRPWFTPRSIYMEASTVDCNDLGLLTPPARLPANTQILLLQ 
TNMIAKIEYSTDFPVWLTGr.DLSQNNLSSVTNrNGKKMPQLLSV 
YLBENKLTELPEKCLSELSNLQELYINHNLLSTISPGAFIGLHN 
DLRLHLNSNRIiOMINS KWFDALPNLEILMIGENPI IRI KDMNFK 
PLINLRSLVIAGINLTE 1 PDNALVGLENLES ISPYDNRLI KVPH 
/ALQK^AOTjKFLDIJNKWPIKRIRRGDFSNMLHLKELGINNMPEL 
r S IDSLAVDNLPDLRKIBATNNPRLS YIHPKAFFRLPKLESljKL 
^SNALSALYHGTIESLPNLKEISIHSNPIRCDCVIRWMN&dNKTN 
ERPMEPDSLFCVDPPEFQGQNVRQVHFRDMMEICLPLIAPESFP 
3MIjNVEAGSYVSFHCRATA\EPQPEIYWITPSG0iCLLPNT\LTD 
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SEQ 
ID 
NO: 


Precllcted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acaa. segment containing signal pepti"H^ 
(A=Axanine, C=Cysteine, D-Aspartic Acid, E= 
CSlutamic Acid, P»Phenyl alanine, G=Glycine, 
H==Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M-Methionine, K=AEpara9ine, 
P^Proline, Q=Glut amine, RsArginine, 
S=Serine, T-Threonine, V=Valine, 
ff^Tryptophan, Y=Tyrosine, X=Unknown, *=gtop 
Codon, /=possible nucleotide deletion, 
\=Fossible nucleotide insertion) 


r 

6027 






KFYVHSEGT1.DINGVTPKEGGLYTCIATNLVGAPLKS^^•IKVDG 
SFPQDNNGSLNIKIRDIQANSVLVSWKASSKILKSSVKWTAFVK 
TENSHAAQSAR IPSDVKVYWLTHLNPSTB YKICI DI PTI YQKNR 
KKCVNVTTKGLHPDQKEYEKNNTTTLMACLGGLLGI IGVICLI S 

CLSPEMNCDGGHSYVRNYLOKPTFALGELYPPLINLWEAGKEKS 
TSLKVKAT VI GIf PTNMS 


6028 


52S4 


4148 


GGRRAPGRfURSIKDEEEETVFREWSFSPDPLPVRYYDKDTTK 

PISPYLSSLEELLAWKPRLEDGFNVALEPLACRQPPLSSQRPRT 

LLCHDMMGGYLDDRFIQGSWQTPYAFYHWQCIDVFVYPSHHTV 

TIPPVGWTNTAHRHGVCVLGTFITEWNEGGRLCEAFLAGDBRSY 

QAVADRLVQITXRFFRFTCWLrNlENSLSLAAVGNMPPPLRYLT 

TQLHRQVPGGLVLWYDSWQSGQLKWQDELNQHNRVFPDSCDGF 

FTNYNWREEHLERMLGQAGBRRADVYVGVDVPARGNWGGRFDT 

DKVGGGPRPRASGPVPPLGPHFLMDLPFPSAPQRNDSSCSSOSG 
DPVALRNRCPAPAKLCPH 


6029 


120 


3432 


NCLIiLQAKGFHGHI EDLQQWLTDTERHLLASKPLGGLPETAKEQ 

LNVHMEVCAAFEAKEBTYKSLMQKGQQMLARCPKSAETNIDQDI 

NNLKEKWESVBTKLNER\KT\KLEEALKLA\MEFHNSL\QDFIN 

WLTQAEQTLNVASRPSLrLDTVLPQIDEHKVPANEVWSHREQII 

ELDKTGTHLKYPSQKQDWLIKNLlilSVQSRWBKWQRLVERGR 

SLDDARKRAKQFHEAWSKLMEWLSESEKSLDSELBIANDPDKIK 

TQtAQHKBPQKSl^AKHSVYDTTNRTGRSLKEKTSIiADDNLKLD 

DMLSSU?DKWDTICQKSVERQNKLEEA\Lt,FSGQFTDALQALID 

WLYRVEPQLAEDQPVHGDIDLVMIILIDNHKAFQKELGKRTSSVQ 

ALKRSARBLIEGSRDDSSMVKVQMQELSTRWETVCALSISKQTR 

LEAALROAEEFHSWHAIiLEWLAEAEQTLRFHGVLFDDEDALRT 

LIDQHKEFMKKLEEKRAELNKATTMGDTVLAI CHPDS ITTI KHW 

it: IRARPEEVIAWAKQHQQRLASALAGLIAKQEr>LEALLAWLQ 

KAETTLTDKDKEVI PQE I EEVKALIAEHQTFMEEMTRKQPDVDK 

VTKTYKRRAADPSSLQSHI PVLDKGRAGRKRPPASSLYPSGSC3T 

QIETKNPRVNLLVSKWQQVWLIALERRRKLNDALDRLBELREFA 

NPDFDIWRKKYMRWMNHKKSRVMDFFRRIDKDQDGKITRQEFID 

GILSSKFPTSRLEMSAVADIFDRDGDGYIDYYEFVAALHPNKDA 

YKPITDADKIEDEVTRQVAKCKCAKRFQVEQIGDNKYRFFLGNQ 

FGDSQOLRLVRILRSTVMVRVGGGWMALDEFI,VKNDPCRAKGRT 

NKELREKPILADGASQGMAAFRPRGRRSRPSSRGASPWRSTSVS 

SQAAQAASPQVPATTTPKILHPLTRNYGKPWLTNSKMSTPCKAA 

ECSDPPVPSAEGTPIQGSKLRIaPGYLSGKGFHSGEDSGLXTTAA 

ARVRTQFADSKKTPSRPGSRAQSKAGSRASSRRGSDASDFDISE 

IQSVCSDVETVPQTHRPTPRAGSRPSTAECPSKIPTPQHKSPASK 
liDKSSKR 




1 


3533 

I 

c 

I 
I 


XMPCGSSRDLHGCWTHPKEPVSDLSYFDCIESVMENSKVLGESM 

AGISQMAKTGDLPAPGECVGIASKALCGLTEAAAQAAYLVGI FD 

PNSQAGHQGLVDPIQFARANQAIQMACQKTLVDPGSSPSQVLSAA 

TIVAKHTSALCNACRIASSKTANPVAKRHFVQSAKEVANSTANL 

VKTIKALDGDPSEDNRNKCRrATAPLIEAVENt.TAFASNPEPVS 

IPAQISSEGSQAQEPILVSAKPMLESSSYLIRTARSLAINPKDP 

PTWSVIAGHSHTVSDSIKSLITSIRDKAPGQRECDySIDGINRC 

IRDIEQASIiAAVSQSIiATRDDISVEALQEQLrSWQEIGHLIDP 

lATAARGEAAQLGHKGTQLASYFEPLILAAVGVASKILDHQQQM 

TVLDQTKTLABSALQMLYAAKEGGGNPKAQHTEIDAITEAAQLMK 

EAVDDIMVTLNEAASEVGLVGGMVDAIAEAMSKLDEGTPPBPKG 

rPVDYQTTWKYSKAtAVTAQEMMTKSVTNPEELGGLASQMTSD 

iTGHtAFQGQMAAATAEPEEIGFQIRTRVQDLGHGCIFLVQKAGX 

U^QVCPTDSYTKRELl ECARAVTEKVSLVLSALQAGNKGTQACI 

rAATAVSGI I ADLDTTI MFATAGTLNABWSETPADHRENI LKTA 

CAIiVEDTKLLVSGAASTPDKLAQAAQSSAATITQIiAEWKLGAA 

JLGSDDPETQWLINAIKDVAKALSDLISATKGAASKPVDDPSM 

fQLKGAAICVMVTNVTSLL.KTVKAVEDEATRGTRALEATIECIKQ 

JLTVFQSKDVPEKTSS PEES IRMTKGITMATAKAVAAGNSCRQE 

)VIATANLSRKAVSDMLTACKQASPHPDVSDEVRTRALRFGTEC 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

cor re spond i ng 

to first 

amino acid 

residue of 

amino acid 

sec[uence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
tA=»Alanine, C=Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, F= Phenyl alanine, G«Glycine, 
H=:Histidine, I=lBoleucine, K=Iiysine, 
L=Leucine, M=Methionine, N-Asparagine , 
Psproline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *aStop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








TLGYLDLLEHVLVILQKPTPELKQQLAAFSKRVAGAVTELIQAA 
EAMKGTE WVDPEDPTVXAETELL6AAAS lEAAAKKLEQLKPRAK 
PKQADETLDFBBQILEAAKSIAAArSALVKSASAaQRELVAQGK 
VGSIPANAADDGQWSQGLISAARMVAAATSSLCEAAWASVQGHA 
SEEKLlSSAKQVAASTAQIiLVACKVKADQDSEAMRRLQAAGNAV 
KRASDNLVRAAQKAAFX3KADDDDVWKTKFVGGIAQI lAAQEBM 
LKK3RELEEARKKLAQIRQQQYKFLPTELREDEG 


6030 


3 


1777 


FPGRGSPALQLE VLI CLGLMGLERALNVIiAP 1 FYRNI VMLLTEN 
APWNSLAWTVTSYVFIiKFLQGGGTGSTGFVSNLRTFLWIRVQQF 
TSRRVELLIFSHIiHEIiSLRMHriGRRTGEVLRIADRGTSSVTGLL 
S YLVFNVI PTLADI 1 1 GI I YFSMFFNAWFGLI VFLCMSLYLTLT 
IVVTEWRTKFRRAMNTQENATRARAVDSLLNFETVKYYWAESYE 
VER YREAI I KYQGLEWKSSASLVLLKQTQNLVrGLGLLAGSLLC 
AYFVTEQKLQVGDYVLFGTYIIQLYMPLNWPGTYYRMIQTNPID 
MENMFDLLKK\STEVKDLPGAGPFRFQKGRIEFENVHPSYADGR 
BTLQDVSPTVMPGQTLALVGPSGAGKSTILRLIiFRFYDISSGCI 
RIDGQDISQVTQALPRFSHWELCPKDTVLPNDTIADNIRYGRVT 
AGrJDEVEAAAQAAGIHDAIMAFPEGYRTQVGERGLKLSGGEKQR 
VAIARTILKAPGI ILLDEATS AIiDTSNERAl CZASLAKVCAWRTT 
1WAHRLSTWNADQII»VIKDGCIVERGRHEALLSRGGVYADMW 
QLQQGQEETSEDTKPQTMER 


6031 


160 


16^4 


LRMSENLDKSNVNEAGKSK^iTDSEEGLEnAVEGADEALQKAIKa " ' 

DSSSPQRVQRPHSSPPRFVTVEELLETARGVTNMALAHEIWNG 

DPQI KP VELPENSIiKKRVKE I VHKAPWDCLS VQIiS EDPPAYDEiA 

IKItVGBIKETIiLSFIiLPGHTRLRKQITEVLDLDIiIKQfiAENGAL 

DIS KLAEFI IGMMGTIjCAPARDEB VKKLKDI KEXVPLFREIFS V 

LDLMKVDMANFAISS IRPHLMQQS VEYERKKFQEI IjE3?QPNS LD 

FVTQWLEEASEDLMTQKYKHALPVGGMAAGSGDMPRLSPVAVQN 

yAYLKLLKWDHLQRPFPETVLMDQSRFHELQLQ\REQLTILGAV 

LLWFSMAAPGISSQADFAEKLKMIVKILJLTDMHLPSFHLKDVI, 

TTIGEK7CLEVSSCLSLCGSSPFTTDKETVLKGQIQAVASPDDP 

IRRIMESRILTFLETYLASGHQKPI/PTVPGGLSPVQRELBEVAI 

KFARLVNYNKMVFCPyyDAILSKILVRS 


€032 


39 


2415 


aarlcraqptksawmirdlskmypqtrhpaphqpaqpfkftise 
scdrikeefqfiioaqyhslkiieceklasektemqrhyvmyyems 
if gln i emhkqae i vkrlnai caqvi p fls qehqqq wqavbrak 
qvtmaelnaiigqqqlqaqhlshghglpvpltphpsglqppaip 
pigssagllalssaiiggqshlpl kdekkhhdktdhqrdrdsi kss 
svspsasfrgaekhrnsadysseskkqktbekeiaarydsdgek 
sddnlwdvsnedpssprgspahsprbngldktrllkkdapisp 

ASIT^SSSTPSSKSKEIiSLNEKSTTPVSKSNTPTPRTDAPTPGS 
NSTPGLRPVPGKPPGVDPLAS SLRTPMAVPCPYPTP FGI VPHAG 
MNGELTSPGAAYAGIiHNISPQMSAAAAAAAAAAAYGRSPWGFD 
PHHHMRVPAI PPNLTGI PGGKPAYSPHVSADGQMQPVPPPPDAL 
I GPG I PRHARQINTLNHGE WCAVTISNPTRHVYTGGKGCVKVW 
DISHPGNKSPVSQLDCIiNRDNYIRSCRIjIiPDGRTLIVGGEASTL 

s iwdlaaptprikaeltssapacyaxiai 3 pdskvcfsccsdgni 
avwdlhnqtlvrqfqghtdgas cidisndgtklwtggldntvrs 

W \DLREGRQLQQHD/ FFTSPVFSLGYCP\ TEEWIiAVGMENSN\ V 

eviihvtkpdkyqlhlhbscvlslkpahcokwp\vstgkdnllna 
ratvyeviy 


6033 


39 


241S 


AARLCRAQPTKSAWMIRDLSKMYPQTRHPAPHQPAQPFKFTISE " 
SCDRIKEEFQPLQAQYHSLKLECEKIiASEKTEMQRHYVMYYEMS 
YGLNIEMHKQAEIVKRIiNAXCAQVIPFIjSQEHQQQVVQAVERAK 
QVTMAELHAI IGQQQLQAQHLSHGHGLPVPLTPHPSGLQP PAI P 
PIGSSAGLLALSSALGGQSHIiPIKDEKKHHDNDHQRDRDSIKSS 
SVS PSAS FRGAEKHRWSADYSSESKKQKTEEK3IAARYDSDGEK 
SDDNLWDVSNEDPSSPRGSPAHS PRENGLDKTRIiLKKDAPIS P 
ASIASSSSTPSSKSKELSLNEKSTTPVSKSNTPXPRTDAPTPGS 
NSTPGLRPVPGKPPGVDPLASSIiRTPMAVPCPYPTPFGIVPHAa 
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SEQ 
ID 
NO: 


Pretiicted 
beginning 
nucleotide 
location 
corresponding 
to first 
aniino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acxd segment containing signal peptiHe" 
(A^Alanine, C=.Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H=Histidinc, I=l3oleucine , K=Lysine, 
LaLeucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutainine, R-Arginine, 
Ss=Serine, TaThreonlne, valine, 
W=iTryptophan, Y=Tyrosine, X^^Unfcnown, *feStop 
Codon, /=poasible nucleotide deletion. 


6034 






HMiiJiI,TSPGAAYAGLHNISPQ|yiSAAAAAAAAAAAYGR5PWGFD 

PHmiMRVPAIPPNI.TQIPGGKPAYSFHVSAD3QMQPVPFPPDAL 

IGPGIPRHARQINTLWHGEWCAVTISNPTRttVYTGGKGCVKVW 

DISHPGWKSPVSQI^DCLNRDWYIRSCRLLPDGRTLIVGGEASTL 

SIWDLAAPTPRIKAELTSSAPACYAUMSPDSKVCFSCCSDGNI 

AVmDLHNQTLVRQFQGHTDGASCIDISNDGTKLWTGGLDNTVRS 

W\DLREGRQLQQHD/FFTSPVFSLGYCP\TEEWLAVGMENSN\V 

EVLHVTKPDKYQLHLHESCVLSLKFAHCGKWF\VSTGKDNLLWA 

ff\RTpyG\ASIF\QSK3SSS\VLSCDl\SVDDKYIVTGS\GDK\ 
RATVYBVIY 


6035 


2683 


714 


ESGRRRRliPJUy^SPCPGTAGGPGETNPGPGACPHGPREEAAAAM 
EIAPQEAPPVPGADGDIEEAPAEAGSPSPASPPADGRLKAAAKR 
VTFPSDEDIVSGAVEPKDPWRHAQNVTVDEVlGAYKQACQKIiNC 
RQIPKLLRQLQEPTDLGHRLDCLDLKGEKLDYKTCEAIiEEVPKR 
LQFKWDLEQTNLDEDGASALFDMIEYYESATHLNISFNECHIGT 

rgwqaaahmmrktsclqylNdarntplldhsapfvaralrirss 

IAVLHLENASLSGRPIiMLLATALK>INmLREI,YL\ADWKU^GLQ 
DSAQLGNLLKFNCSLQILDLRNKHVLDSGLAYXCBGLKBQRKGIj 
VTLVVLWNNQLrHTGMAFLGMTLPHTQSLETLNLGHNPIGNEGV 

rhlknglisnrsvlrlglastkltcegavavaefiaesprllrl 

DLRENEIKTI^LMALSIALKVNHSLLRLDLDREPKKEAVKSFIE 

tqkallaeiqngckrnlvlarereekeqppqlsasmpettatep 
qpddepaagvqngapspapspdsdsdsdsdgeeeebeegereet 
psgaidtrdtgssepqpppepprsgpplpnglkpefaiai,ppbp 
ppgpevkggscglehelscsknekeleellleasqbsgqetl 


6036 


13 


404 


SVTYLGIHOiKNTGALPADPVQLISQTPTPSTKQQI^LSPLGMVG 

yfylwipgfailtkplckltkeniadaidpkspshssfrslkta 

LENASTTiALPDSSQPFNSLHTAEVQGCWElLTQGLGPLPV 


6037 


1745 


tTE 


i^pdvefclgrrrgrkmdsvekgaatsvsnprgrpsrgrppklqrn 

SRGGQGRGVBiCPPHrAALILARGGSKGIPLKNIKHriAGVPLIGW 

vlraaldsgafqs vwvstdhdeienvakqfgaqvhrrss e vskd 
sstsldaiieplnyhnevdivgniqatspclhptdlqkvaemir 
eegydsvfswrrhqfrwseiqkgvrevteplnlnpakrprrqd 
wdgelyengsfyfakrhliemgylqggkmayyemraehsvdidv 
didwpiaeqrvlrygyfgkeklkeikxlvcnidgcltoighiyvs 
gdqkeiisydvkdaigisllkksgievrliseracskqtlsslk 

AjuK^&j»i]*. v£> v&xjkJjAVVDE wrkemglcwkevaylgnevsdbeclk 

RVGLSGAPADACSTAQKAVGYICKCNGGRaA\lRBFAEHIC\LL 
MBKGLINFMPKNRNIiAVNIGEKK 


6038 


2936 


1919 


WTSWWMSSVLTHiLFSLQGNKMLNYSAPSAGGYLLPRKPVGTPA 
GGGFPRRHSVTLPSSKFRQNQLLSSLKGBPAPALSSRDSRFRDR 
SFSEGGERLLPTQKQPGGGQVNSSRYKT\ELCRPFEENGACKYG 
DKCQPAHGIHELRSLTRHPKYKTELCRTFHTIGFCPYGPRCHFI 
HNAEERRALAGARDLSADRPRLQHSFSFAGFPSAAATAAATGLL 

DSPTSITPPPILiSAr>r>T.T/3QD'Pr "Drw^T^KThTnotV 

PSMGLPGGGSPTTFLFRPMSESPHMPDSPPSPQDSLSDQEGYLS 
SSSSSHSQ3DSPTLDNSRRLP I FSRLS I SDD 


6039 


1450 


426 

< 


SSALQEFGTRWHTF6VPLPHRRKQIISCNXCQLRFNSDSQAAAH 
YKGTFCHAKKLKALEAMKNKQKSVTAKDSAKTTFTSITTNTINTS 
SDKTDGTAGTPAISTTTTVEIRKSSVMTTEXTSKVEKSPTTATG 
KSSCPSTETEEBKAKRLL\YCSLCKVAVWSASQLEAHNSGTKHK 
rWLEARKGSGTIKAFPRAGVKGKGPVNKGNTGLQNKTFHCEICD 
raVNSETQLKQHlSSRRHKDRAAGKPPKPKYSPYNKLQKTAHPL 
SVKLVFSKEPSKPLAPRILPNPIiAAAAAAAAVAVSSPPSLRTAP 
EiATLFQTSALPPAliLRPAPGP IRTAHTP VLPAPY 




4073 


lOOO ] 

1 
I 

. \1 


L.DEyEARLTLAIVX,DDFEEDNEDDDENRVNQEBKAAKITELINKL 
JPLDEAEKDLATVNSNPFDDPDAAELNPFGDPDSEEPITETASP 
IKTEDSPYNNSYNPFKEVQTPQYLNPFDEPEAFVTIKDSPPQST 
CKKNIRPVDMSKYLYADSSKTEBEELDESNPFYEPKSTPPPNNL 
WPVQBLETBRRVKRKAPAPPVLSPKTGVLNENTVSAGKDLSTS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
secjuence 


Amino acxa segment containing signal pepti^Hi^ 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G:=Glycine, 
H»Histidine, I-Isoleucine, K=Lysine, 
L=:Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
SaSerine, T=Threonine, V= Valine, 
W=Tryptophan, Y==Tyrosine, X^Unknovm, *=.Stop 
Codon, /=sposeible nucleotide deletion, 
\»pOBsible nucleotide insertion) 


6040 






PKPSPIPSPVLgRKPNA3QSLLVWCKEVTKNYRGVKIl"MFTTSW~" 

RKGLSPCAILHHFRPDLIDYKSLNPQDIKENKTKKAYDGFASIGI 

SRLLBPSPMVLLAIPDKLTVMTYLYQIRAHFSGQELJTWQISEW 

SSKSTYKVGNYETDTNSSVDQEKPYAKLSDLKREPELQQPISGA 

VDFLSQDDSVPVNDSGVGESESEHQTPDDHLSPSTASPYCRRTK 

SDTEPQKSQQSSGRTSGSDDPGICSNTDSTQAQVLLGKKRLLKA 

ETLEI.SDLYVSDKKKDMSPPFICEETDEQKLQTLDIGSNLEKEK 

LENSRSIiECRSDPESPIKKTSLSPTSKLGYSYSRDLDLAKKKHA 

SLRQTESDPDADRTTLNHADHSSKIVQHRLLSRQEELKERARVL 

LEQARRDAALKAGNKHNTNTAAPFCNRQLSDQQDEERRRQLRER 

ARQLIAEARSGGKMSELPSYGERAAEKLKERSKASGDENDNIEI 

DTNEEIPEGFWGGGDELTNLENDLDTPBQNSKLVDLKLKKLLE 

VQPQVANSPSSAAQKAVTESSEQDMKSGTEDLRTERLQKTTERF 

RKPWPSKDSTVRKTQLQSPSQYIENRPEMKRQRSIQEDTKKGN 

EEKAAITETQRKPSEDEVLNKGFKDSSSQYWGELAALENEQKQ 

IDTRAALVEKRLRYLMDTGRNTEEEEAM^5QEWFMDVNKK1VIALIR 

RKNQLSLLEKBHDLEHR YELLNRELRAMiAI EDWQKTEAQKRRE 

qlllde:jvalvwkrdalvrdi.daqekoaeebdehlertleqnkg 

KI^IAKKEE KCVLQ 


6041 


475 


1052 


PTALMTAPSUAFPVQFRQPSVSGLSQITKSLYISNGVAANNKLM 
IiSSNQirMVINVSVBWNTLYEDIQYMQVPVADSPNSRLCDFFD 
PIADHIHSVEMKQGR\TI.LHCAAGVSRSAALCIAYi:,MKYHAMSL 
LDAKTWTKSCRPIIRPNSGFWEQLIHYEFQLFGKNTVHMVSSPV 
GMIPDIYEKEVRLMIPL 


6042 " 


2 


3B86 

] 


TEKDEKTAHNI,ENVLIHFWERLSEICVAKISEPEADV1ESVLGVS 

NLrcVLQKPKGSLKSSKKKWGKVRFADEILESNKENEKCVSSEG 

EKIECWELTTEPSLTHNSSGLLSPUIKKPLEDLVCKLADISINY 

VNERKSEQHLRFLSTLLDSPSSSRVFKMLLGDEKQSIVQAKPLE 

lAKLVQKNPAVQFLYQKLIGWLNEDQRKDFGFLVDILYSALRCC 

DNDMERKKVLDDLTKVDLKWNSLLKI lEKACPflSDIOiALVTP WI, 

KGDILGEKLVNIiADCliCNEDLESRVSSESHFSERWTLLSLVLSQ 

HVKNDYLrGDVYVERnVRMETIjFKTKlKLSEAESSDSSVSFrC 

DVAYNYFSSAKGCLLMPSSEDLLLTLFQLCAQSKEKTHLPDFLI 

CKLKNTWUSGVNLLVHQTDSSYKBSTFLHLSALWLKNQVQASSL 

DINSLQVLLSAVDDLLNTLLESEDSYLMCVYIGSVMPNDSEWEK 

MRQSLPMQMLHRPLLEGRLSLWYSCFKTDFKEQDIKTLPSHLCT 

SALLSKMVLIAliRKETVLENWELEKIIAELLYSLQWCEELDNPP 

IFLIGFCBILQKMNITYDNLRVLGMMSGLLQLLFNRSREHGTLW 

Shi X AKLI LSRS IS SDEVKPHYKRKES FFPLTEGWLHTIQSLCP 

FLSICEEKKEFSAQCIPALLGWTKKDLCSTNGGFGHLAIPMSCLQ 

TKSIDDGELLHGILKIIISWKKEHEDIFLFSCNLSEASPEVLGV 

NlEXIRFLSLPLKYCSSPLAESEWDFIMCSMLAWLETTSENQAIi 

YSIPLVQLFACVSCDLACDLSAPFDSTTLDTIGNLPVNLISEWK 

EFPSQGIHSLLLPlLVTVTGENKDVSETSPQNAMLKPMCErLTY 

ISKEQLLSHKIiPARLVADQKTNLPEYIiaTLLNTriAPLLLFRARP 

VQIAVYHMLYKLMPELPQYDQDNLXSYGDEEEEPALSPPAALMS 

LLSIQBDLLENVLGCIPVGOIVTIKPLSEDFCYVLGYLLTWKLI 

LTFFKAASSQLRALYSMYLRKTKSLNKLIjYHLFRLMPENPTYAE 

TAVEVPNKDPKTPPTEELQLSIRETTMLPYHIPHLACSVYHMTL 

kdlpamvelwmnssekrvfnivdrftskyvssvlsfqeissvqt 

STQLFNGMTVKARATTREVMATYTIEDIVIELIIQDPSNYPI/3S 
IIVESGKRVGVAVCK3WRNWMLQLSTYLTHQNGSIMEGLALWKNN 
TOKRFEGVEDCMXCFSVIHGFNYSLPKKACRTCKKKFHSAVCLY 
KWFTSSNKSTCSLiCRETFF 




1306 


253 f 
( 
C 
I 
C 
I 

c 


«laeiapasp4>dikasvsngdrtllcsrrqscgmnevr0vsltyp~ 

3spapshslplqprsggslcpsraw/pdphqlpddtssaqsrqy 

saqrapgglsypaasptphaapladpvsnmamaygsslaaqgke 

.VDKNIDRFIP ITKLECYYFAVDTKYVGRKLGLLPPPYLHQDWBV 
SYQODTPVAPRFDVNAPDLYIPAMAPITYVLVAGIALGTQDRFS 
>DLLGLQASSAIAWLTLEVlAILiLSLYLVTVNTDLTTrDLVAFL 
JYKYVGMIGGVLMGLLPGKIGYYLVLGWCCVAIFVFMIRTLRLK 
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SBQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
cimino acid 
residue of 
amino acid 
sequence 


-i- nj. t- (r u enu 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segtnenc containing signal peptid~ 
(A«Alanine, C=Cyeteine, D=Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L«Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=3erine, T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\ -possible nucleotide insertion) 


6043 


403 


599 


^liADAAAECSVPVRGARNQIiRMYLTMAVAAAQPMIiMYWLTFHLVR 
LCLFFPFPCATPVLPr.PSLISAT./CI;SHLSVSSWFCPCQPPLPC 
PBPPLQNKTAKGSLSTEQSERG 


6044 


793 


412 


KLEMWNPTi.XSKVKISREVTMIASKFGIGQQVRHSLLGyLGWV 
DIDPVYSLSEPSPDELAVNI>BLRAAPWYH^A^BDDNGLPVHTYL 

aeaqlsselqdehpXeqpsmdelaqtirkqlqaprlrn 


S045 


155 


2299 


splpqvaamnyiirrrlsdskfmanlpkgymtdiiqrpqppppppg'" 
ahspgatpgpgtataerssgvapaaspaapspgssggggffssl 

SMAVKQTTAAAT^TFSEQVGGGSGGAGRGGAASRVLLVIDEPHT 
DWAKYFKGKKIHGBIDIKVEQAEPSDLNLVAHANGGFSVDMEVIi 
RNGVKWRSLKPDFVLIRQHAFSMARNGDYRSLVIGLQYAGIPS 
VNSLHSVYNFCDKPWVFAQMVRLHKKLGTEEFPLIDQTFYPNKK 
EMliSSV'ITYPVWKMGHGTLWGWGKVKVDNQHDFQDIASVVALT 
KTYATAEPFIDAKYDVRVQKIGQNYKAYMRTSVSGNWKTNTGSA 
MLEQIAMSDRYKLWVDTCSEI FGGLDICAVEALHGKDGRDHI IE 
WGSSMPLlGDHQDEDKQLIVELWNKMAQALPRQRQRnASPGR 
GSHGQTPSPGALPLGRQTSQQPAGPPAQQRPPPQGGPPQPGPGP 
QRQGPPLQQRPPPQGQQHLSGLGPPAGSPLPQRLPSPTSAPQQP 
ASQAAPPTQGQGRQSRPVAGGPGAPPAARPPASPSPQRQAGPPQ 
ATRQTSVSGPAPPKASGAPPGGOOROGPPOKPPGPAGPTPnaQO 
AGPVPRTGPPTTQQPRPSGPGPAGRPKPQLAQKPSQDVPPPATA 

AAGGPPHPQLNKSQSLTNAFNLPEPAPPRPSLSQDEVKAETIRS 
LRKSPASLFSD 


6046 


212 


1075 


EGLTGPCERVPFLLGRGPPHGATRAGHRRAVRWAGPESLPPLPR 
SLlMDSPRAGTHQGPLDAETEVGADRCrSTAYQEQRPQVEQVGK 
QAPLSPGLPAMGGPGPGPCEDPAGAGGAGAQQSEPLVTVTVQCA 
PTVALRARRGADIiSSIiRALLGQALPHQXAQLGQLSYIiAPGEDGH 
WVPIPESESLQRAWQDAAACPRGLQLQCRGAGGRPVLYQWAQH 
SYSAQGPEDLGFRQGDTVDVLCEVDQAWLEGKCDGRIGIFPKCF 
WPAGPRMSGAPGRLPRSQQGDQP 


6047 


49 


1405 


PVLVTSIjRWREADTliRPPQLMEVSADI ISTVEPKrHTRPT t JXTrri 
KC-GRWIFQREPESKNAPHSQGEYDVYSTFQSHBPEFDYLKSLE 
lEEKINKIKWLPQQNAAHSLLSTNDKTIKLWKITERDKRPEQVN 
LKDEEGKIJCDLS'rVTSLQVPVLKPMDLMVEVSPRRrPTiJCGHTYH 
INS IS VNSDCETYMSADDLRINLWHLAITDRSFTP \NI VDIKPA 
NMBDLTEVITASEFHPHHCNLFVYSSSKGSLRLCDMRAAALCDK 
HSKLFEEPEDPSNRSFFSEIIS\SVSDVKFSHSDRYMLTR\DYL 

tvkvwdlXnmbarpietyqvhdylrsklcslyendci FDKFECA 

WNGSDSVIMTGA\YNNPFRMPDRKTKRDVTL\EASRESSKPRAV 
LKPRRVCVGGKRRRDDISTOSLDFTKKILHTAWHPAENIIAIAA 

tnnlyi fqdkvnsdmh 


6048 


1 


3194 

1 


GIRTPKFCDSPTSDLEMRNGRGRGKRMRPNS^3TPVNETATASDS 

KGTSNSSKTRAGAWSKGRRGSQNSSEHRPPASSTSEDVKASPSS 

ANKRKNKPLSDMELNSSSEDSKGSKRVRTNSMGSATGPLPGTKV 

EPTVLDRNCPSPVLIDCPHPNCWKKYKHINGLKYHQAHAHTDDD 

SKPEADGDSEYGEEPILHADLGSCWG\ASVSQK\GSLSPARSAT 

PKVRLVEPHgpSPSSKFSTKGLCKKKLSGEGDTDLGALSNDGSD 

DGPSVMDETSNDAFDSLERKCMEKEKCKKPSSLKPEKIPSKSliK 

SARPI/APLAIPPQQIYTPQTATPTAASPGSSSGLTATVAQAMP 

NSPQLKPIQPKPTVMGEPFTVNPALTPAKDKKKKDKKKKESSKE 

IjBSPLTPGKvCRAEEGKSPFRESSGNGMKMEGLIjNGSSDPHQSR 

liASIKAEADKIYSFTDNAPSPSIGGSSRLENTTPTQPLTPLHVV 

TQ>rGAEASSVKTNSPAYSDISDAGEDGEGKVDSVKSKDAEQI,VK 

EGAKKTLPPPQPQSKDSPYYQGPESYYS PS YAQSS PGALNPSSQ 

RGVESQALKTKRDEEPESIEGKVKNDICEEKKPELSSSSCX3PSV 

IQQRPKMYMQSLYYNQYAYVPPYGYSDQSYHTHLLSTNTAYRQQ 

STEEQaKRQSLBQQQRGVDKKAEMGLKEREAALKEEWKQKPSipp 

rLTKAPSLTDLVKSGPGKAKEPGADPAKSVIIPKLDDSSKLPGQ 

tiPEGLKVKLSDASHLSKEASEAKTGAECGRQAEMDP I LWYRQEA 

SPRMWTYVYPAKYSDIKSEDERWKEERDRKLKEERSRSKDSVPK 
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r SEQ 
ID 
NO: 


Prediccect 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acaa segment containing signal peptide"" 
CA=Alanine, C=Cyste4.ne, D«.A3partic Acid, B= 
Glutamic Acid, F^Phenyl alanine, G==Glycine, 
H-Histidine, I-Isoleucine, K^Lysine, 
L«Leucine, M=Methionine , N^Asparagine, 
PsProline, Q==Giutamine, R»Arginine, 
o-oerane, i=Tnreonine, V=: Valine, 
W=Tryptophan, Y«Tyrosine, X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6049 






JgDGKESTSSUCKLPTSEESRLGSKEPRPSVHVPVaSPLTQHOSY" 
IPYMHGYSYSQSYDPNHPSYRSMPAVMMONyPGSyiiPSSYSFSP 
YGSKVSGOEDADKARASPSVTCKSSSESKALDILQQHASHYKSK 
S PTISDKTSQERDRGGCGWGGGGSCS S VGGASGGERSVDRPRT 


5050 


215 


10S9 


AMTQVFDRRVPSIRSGUFQAPFQTSAAMHHPSUKSPTLPESSAT" 

dsdyysptggaphgycsptsasygXkalnpyqyqyhgvkgsags 

YPAKAYADYSYASSYHQYGGAYNRVPSATNQPEKEVTEPEVRMV 
NGKPKKVRKPRTiySSPQIAALQRRFQKTQYLALPERAELAASL 
GLTQTQVKXWFQNKRSKlKKIMKhTGEMPPEHSPSSSDPMACNSP 

ospavwepqgssrslshhphahpptskqspassylensaswyts 

AASSINSHLPPPGSLQHPIiALASGTLY 


6051 


566 


1718 


KGLBRTCCAMEHSDSBKTTEKENLGPRMDPPLGEPG \QSLGWVL " 

PNTAMKKKVLLMGKSGSGKTSMRSIIFANYIARDTRRLGATILD 

RIHSLQINSSIiSTYSLVDSVGNTKTFDVEHSHVRPLGNLVLNLW 

DCGGQDTFMBNYFTSQRDWIFRNVEVLI Y VFDVESRELEKDMHY 

YC3SCLEAILQNSPDAKIFCLVH1CMDCVQEDQRDLIPKEREEDLR 

RLSRPLECSCFRTSIWDETLYKAWSSIVYQLIPNVQQLEMMLRN 

PAEIIEADEVLLPERATPLVISHYQCECEQRDAHRFEKISNIIKQ 

FKLSCSKIAASFQSMEVRNSNPAAFIDIFTSNTYVMWMSDPSI 

PSAATLIKIRNARKHPEKLERVDGPKQCLLMR 


J 6052 


566 


1718 


KUl.ERTCCAMKJi:SDSBKTTEKENLG?RMDPPI>GEPG\GSI.GWVL " 

PNTAMKKKVLLMGKSGSGKTSMRSIIFANYIARDTRRLGATILD 

RIHSLQINSSLSTYSLVDSVGNTKTPOVEHSHVRFLGNLVLNLW 

DCGGQDTFMENYFTSQRDNIFRWVEVLIYVFDVESRELEKDMHY 

YQSCLEAILQNSPDAKIFCLVHKMDLVQEDQRDLIPKEREBDLR 

RLSRPLECSCFRTSIWDETLYKAWSSIVYQLIPNVQQLEMNLRN 

FAEIIEADEVLLFERATPLVISHYQCKEQRDAHRFEKISNIIKQ 

FKLSCSKLAASPQSMEVRNSNPAAFIDIFTSNTYVMWMSDPSI 

PSAATIirNIRNARKHPBKLERVDGPKQCLLMR 


6053 


566 


1718 


KGIiERTCCAMiSESDSEKri'KKSNI^PRMDPPLGEPGVCiSI^W^^ 

FNTAMKXKVLLMGKSGSGKTSMRSIIPANYIARDTRRLGATILD 

RIHSI.QINSSLSTYSLVDSVGNTKTFDVEHSHVRFLGNLVLNLW 

DCGGQDTFMENYFTSQRDNIFRNVEVLXYVPDVESRELEKDMHY 

YQSCLEAILQNSPDAKIFCLVHKMDLVQEDQRDLXFKBREEDLR 

RLSRPLECSCFRTSIWDETLYKAWSSIVYQLIPNVQQLBMNLRN 

PAEIIEADEVLLFERATPLVISHYQCKEQRDAHRFEKISNI IKQ 

FKLSCSKIiAASFQSMEVRNSNFAAFIDI FTSNTYVMWMSDPSI 

PSAATLINIRNARKHFEiCLERVDGPKQCLLMR 


6054 


201 


1704 


Kl>X£MUKSRWQSRRRHGRRSHU(3NPWFRLRDSBDRSDSRAAQPA " 

HDSGHGDDBSPSTSSGTAGTSSVPELPGFYPDPEKKRYFRLLPG 

HNNCKPLTKESIRQKBMESKRLRLLQEEDRRKKIARMGFNASSM 

LRKSQLGPLNVTNYCEIIAHELRLSCMBRKKVQIRSMDPSALASD 

RFNLILADTNSDRLFTVNDVTVGGSKYGI INLQSLKTPTLKVFM 

HEWLYFTtTRKVNNSVCWASIiiraLDSHILLCLMGLAETPGCATLI, 

PASLFVWSHPAGIDRPG\MLCSFRIPGAWSCAWSLBIIOANNCFS 

rGLSRRVLLTWVVTGHRQSPGrNSDVIAQQFAr4P^LLFNGCR^ 

S3IPAIDLRCGNQGKGWKATRLFHDSAVTSVRILQDEQYUyiASD 

MAGKIKLWDLRTTKCVRQYEGHVNEYAYLPtiHVHEEEGILVAVG 

aDCYTRIWSLHDARLLRTIPSPYPASKADIPSVAFSSRLGGSRG 

\PGLLMAVaQDLYCYSYS 




1 


1054 ] 
I 
\ 

C 
I 

V 

p 


rr J.rtf^.^arwSRRHMflAPSGVHLLVRRGSHRlFaS'PIJJHIYIJ^ ^ 

CQSSSQQRHNFFFRRQRDISHSIVLPAAVSSAHPVPKHIKKPDY 

TOIVPDWGDSIEVKNBDQIQGLHQACQLARHVLLLAGKSLKV 

)MTTEEIDALVHREIISHNAYPSPLGYGGPPKSVCTSVNNVLCH 

JIPDSRPLQDGDIINIDVrVYYNGYHGDTSETFLVGKVDECGKK 

iVEVARRCRDEAIAACRAGAPFSVIGNTISHITHQNGFQVCPHF 

^GHGIGSYFHGHPEIWHHANDSDLPMBEGMAFTIEPIITEGSPE 

KVLEDAWTWSriD/TSKVSAQFEHTVUTSRGAQILTKIiPHEA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amiao acid segment containing signal peptide" 
[A=Alanine, C=Cysteine, D=Aspairtic Acid, Ea 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Hist idine , 1=1 soleucine , K^Lysine , 
I»«Leucine, M=Methionine, N=Asparagine, 
P«Proline, Q^Glut amine, R=Arginine, 
S=Serine, T=Threonine, V«Valine, 
W=Tryptophan, Y==Tyrosine, X:=Unknown, *=^stop 
codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6055 


421 


2364 


PPYFLLSFIAWWLYGQSDRTETDISQSAGPPPGTLQCSAI^HHDP 
GCT^CSRFCRDCSPPACQCHTHVPPGNALNG^/QPPELSRTLALI 
SSREPPRKKKKSQTETGKERERTSPLTQGGKRPELQHGLAGZCM 
TLLITGDSIVSAEAVWDHVTMANREIiAPKAGDVIKVIiDASNKDW 
KWGQIDDEEGWFPASFVRIjWVNHEaEVEEC!P5invnwr'i4T noMc-n 

CLCLGRPLQKRDQHRANVINE IMSTERHY I KHLKDICEGYLKQC 
RKRRDMFSDEQLKVIFGNIEDIYRFQMGFVRDLBKQYNNDDPHL 
SEIGPCFLEHQDGFWIYSEYCNNHLDACMELSKU^KDSRYQHFF 

eacrllqqmidiaXidgflltpvqkickyplqlaellkytaqdh 

SDYRYVAAALAVMRNVTQQINERKRRLENIDKIAQWQASVLDWE 
GBDILDRSSEIiIYTGE^^AWrYQP\YGRNQQRVFFLFDHQMVLCK 
KDLIRRDILYYKGRIDMDKYEWDIEDGRDDDFNVSMKNAFKLH 
NKETEEIHr.FFAKICLEEKIRWLRAFREERKMVQSDEKIGPEISE 
NQKRQAAMTVRKVPKQKGVNSARSVPPSYPPPQDPLKHGQYLVP 
\DGIAQ3QVFEFTEPKRSQSPFWQNPSRLTPFKK 


6056 


43 


3358 


SGGRGPVRVRSEQLSPSAEQVSQISQISLGRRPr.SSLPPPPSRA" 

LAPTRAPDTALTIMEVAEVESPBNPSCKIMTFRPSMEBFREFNK 

YLAYMESKGAHRAGIiAKVI PPKE WKPRQCYDDIDNLLI PAPIQQ 

MVTGQSGLFTQYNIQKKAMTVKEFRQLANSGKYCTPRYLDYEDL 

ERKYWKNIiTFVAPIYGADINGSIYDEGVDEWNIARLNTVLDWE 

EECGI S I EGVNTPy LYFGMWKTTFAWHTEDMDLYSXNYLHPGEP 

ICSWYAIPPEHGKRLERIAQGFFPSSSOGCDAFLRHKMTLISPSV 

LKKYG I PFDKI TQEAGEFMIT PP YGYHAGFNHGFNCAES TWFAT 

VRWIDYGKVAKLCTCRKDMVKISMDIFVRKFQPDRYQLWKQGECD 

lYTIDHTKPTPASTPEVKAWLQRRRKVRKASRSFQCARSTSKRP 

KADEEEEVSDBVDGAEVPNPDSVTDDLKVSEKSEAAVKLRWTEA 

SSEEESSASRMQVEQNLSDHIKLSGNSCliSTSVTEDIKTEDDKA 

YAYRSVPSISSEADDSIPLSTGYEKPEKSDPSELSWPKSPESCS 

S VAESNGVLTEGEESDVESHGNGLEPGEI PAVPSGERNS FKVPS 

lAEGENKTSKSWRHPLSRPPARSPMTLVKQQAPSDEELPBVLSl 

EEEVEETESWAKPIilHLWQTKPPNFAABQBYNATVARMKPHCAI 

CTLLMPYHKPDSSNEENDARWETKLDEWTSEGKTKPLIPEMCF 

lYSEEWIBYSPPNAFIiEEDGTSLLISCAKCCVRVHASCYGIPSH 

E ICDGWLCARCKRNAWTAECCLCNLRGGALKQTKNNKWAHVMCA 

VAVPBVRFTNVPERTQIDVGRIPLQRIiKLKCIFCRHRVKRVSGA 

ciqcsygrcpasfhvtcahaagvlNmepddwpywnitcfrhkv 
npnvkskacekvisvgqtvitkhrntryyscrvmavtsqtpyev 

MFDDGSFSRDTPPRnTU'^T?nr'T.lfT/3DDai?r«l?tn7'r\TrvMr>T^**»-oT vi^ 
AKYFGSNIAHMYQVEFEDGSQIAMKREDIYTLDEELPKRVKARF 

VSAGRCHLGTCQVNSLSSPHVSQAQQETYLGFWINSKKSQCNIF 
LSGTY 


6057 


1 


853 


FVAR1.KEQEGEGGLGPRKEKGRARGRERRRKMQLTRCCFVFLVQ 
GSIiYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKS 
RPMANSTIiLGIiIAPPGEAWG I LGQPPNRPNHS PPPSAKVKKI PG 

WGDFYSNIKTVALNLLVTGKIVDHGNGTFSVHFQHNATGQGNIS 
I SIjVPPS KAVE FHQEQ 0 IFI EAKASKI PNC \ RME WE ICVP \ 15 Pt>tj 

TSLPTHDPAKICSRDHAQSSATWSCSQPFKWCVYIAPYSTDYR 
LVQKVCPDYNYHSDTPYYPSG 


6058 


1 


986 


HPLPSASLGLPSVSLGVSLCVRSALLEAVVPMLPKRRRARVGSF" 

SGDAASSTPPSTRFPGVAIYLVEPRMGRSRRAFLTGIARSKGPR 

VliDACSSEATHWMEETSAEEAVSWQERRMAAAPPGCTPPALLD 

ISWLTESLGAGQPVPVECRHRLEVAGPSKGPLSPAWMPAYACQR 

PTPLTHHNTGLSBALE ILAEAAGFEGSBGRLr^TFCRAASVLKAL 

PSPVTTLSQLQGLPHFGEHSSRWQEIiIjEHGVCEEVERVRRSB/ 

RLFTQIFGVGVKTADRWYRBGLRTLDDLRBQPQKLTQQQKAGEP 

SREAGPWASLNCTLDPSASTP 


605S 


2 


3650 


QQDFSSLADLTDHRAHRCPGDGDDDPQriSWVASS PSS KDVASPT 
QMIGDGCDHSLGEBEGGTGLPYPCQFCDKSFIRLSYLKRHEQIH 
SDKLPFKCTYCSRLPKHKRSRDRHIiaHTGDKKYHCHECEAAFS 
RSPHLKIHLKTHSSSKPPKCTVCKRGFSSTSSI^QSHMQAHKKNK 
EHIAKSEKEAKKDDFMCDYCEDTFSQTEELEKHVLTRHPQIiSEK 
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SEQ " 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
correspond ing 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide" 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, lalsoleucine, K=Lysine; 
LaLeucine, M=Methionine, N=Asparagine, 
P^Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, y=Tyi:osine, X=*tInknown, *^Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ADLQCIHCPEVFVDENTIiLAHIHQAHANQKHKCPMCPEVQFSSV 
\EGVYCHLDSHRQPDSSNHSVSPDPVLGSVASMgSATPDSSASV 
ERGSTPDSTLKPLRGQKKMRDDGQGWTKWYSCPYCSKRDFNSL 
AVLEIHLKTXHADKPQQSHTCQICLDSMPTLYNLNEHVRKIiHKN 
HAYPVMQFGNISAFHCNYCPEMFADINSLQEHIRVSHCGPNANP 
SDGNNA?FC3JQCSMGFLTESSLTEHIQ\Q\AHCSVGSAKLESPV 
VQPTQSFMEVYSCPYCTNSPI FGS ILKLTKHIKENHKNIPLAHS 
KKSKAEQSPVSSDVEVSSPKRQRLSASANSISNGEYPCNQCDLK 
FSNFESFQTHLXLHLELLLRKQACPQCKEDFDSQESLLQHLTVH 
YMTTSrHYVCESCDKQFSSVDD\LQKH\LLDMPHPLCCTHCT\L 
CQE VPDS\KVS I \QVHIiAVKHSNEKKMYRCTACNWDFRKEADIiQ 
VEIVKHSHLGNPAKAHKCXPCGETFSTEVELQCHITTHSKKYNCK 
FCS KAFHAI ILLEKHLREKHCVFDAATEKGTANGVPPMATKKAE 
PADIiQGMLLKNPEAPNSHEASEDDVDASEPMYGCDICGAAYTME 
VLLQNHRLRDHNIRPGEDDGSRKKAEFIKGSHKCNVCSRTFFSE 
NGLREHLQTHRGPAKHYMCPICGERPPSLLTLTEHKVTHSKSLD 
IGTCRI CKMPLQSEEE PI EHCQMHPDLRNS LTGPRCWCMQTVT 
STLELKIHGTFHMQKLAGSSAASSPNGQGLQKLYKCALCLKBFR 
SKQDLVKLDVNGIiPYGLCAGCMARSANGQVGGLAPPEPADRPCA 
GLRCPECSVKFESAEDLBSHMQVDHRDLTPETSGPRKGTQTSPV 
PRKKTYQCIKCQMTFENEREIQIHVANHMIEEGINHECKLCNQM 
FDSPAKLLCHLIEHSFBGMGGTFKCPVCFTVFVQANKLQQHIFA 
VHGQEDKIYDCSQCPQKFFFQTELQNHTMSQHAQ 


6060 


2145 


202 


SYEIVGKNKLEVNHSQLKALCKCSIiPSRLLPLGENLPXiLDRGFR 
KEPRSRGSRERDNMLHLHHSCLCFRSWLPAMLAVLLSLAPSASS 
DISASRPWILLIiMADDLGIGDIGCYGWNTMRTPWIDRLABDGVK 
LTQKISAASIiCTPSRAAFLTGRYPVRSGMVSSIGYRVLQWTGAS 
GGLPTNETTFAK ILEBKG YATGLIGKWHIjGIiNCESASDHCHH P L 
HHGFDHF YGMP FSLMGDCARWELSB KRVNLiEQKLNFLFQVIiALV 
ALTLVAGKLTHIiIPVSWMPVIWSAIiSAVLLJLASSYPVGALIVHA 
DCFLMRNHTITEQPMCFQRTTPLILQEVASFLKRMKHGPFLLPV 
S FLHVHI PLITMENFLGKSLHGLYGDNVKEMDWMVGRILDTLDV 
EGLSNSTLIYFTSDHGGSLENQLGNTQYGGWNGIYKGGKGMGGW 
EGGIRVPGIFRWPGVIiPAGRVIGEPTSLMDVFPTWRlAGSEVP 
QDRVIDGQDLLPLLLGTAQHSDHEFLMHYCERFLHAARWHQRDR 
GTMWKVHFVTPVFQPEGAGACYGRKVCPCFGEKWHHDPPLLFD 
LSRDPSETHILTPASEPVFYQVMERNVQQAVWEHQRTIiSPVPLQ 
LDRLGN I WRPWLQPCCGP P P IjCWCLREDD PQ 


6061 


110 


133 0 


MNIHMKRKTIK^! INTFENRMI*MIJ3GMPAVRVKTELLESEQGS PN "■ 

VHNYPDMBAVPLLI.NNVKGEPPEDSLSVDHFQTQTEPVDLSINK 

ARTSPTAVSSSPVSMTASASSPSSTSTSSSSSSRIiASSPTVITS 

VSSASSSSTVLTPGPLVASASGVGGQQFLH I IHPVPPSSPMNLQ 

SNKLSHVHRIPVWQSVPWYTAVRSPGNVNNTIWPLLEDGRG 

HGKRQMDPRGLSPRQSKSDSDDDDLPNVTLDSVNETGSTALSIA 

RAVQEVHPS PVSRVRGNRMNWQKFPCS I SPPSIESTRRQRTVLM 

KCTWEGCTWKFARSDELTRHYRKHTGVKPPKCADCDRSPSRSDH 
lALHRRRHMLV 


6062 


71 


1079 


ETMAKNGPENCBDCH I L^TAEAFKSKKICKSLKICGLVFG I LAIiT"" 
LI VLFWGS KHFWPEVPKKAYDMEHTFYSNGEKECKI YME IDFVTR 
TE I FRSGNGTDETLEVHDFKNGYTGI YFVGLQKCFIKTQl KVI P 
EFSEPEEEIDENEEITTTPFEQSVIWVPAEKPIENRDFLKNSKI 
LE I CDNVTMYW\ INPTL\ISGTFAKQLHHlSfPAFI ILVS ELQDPE 
EEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEBELP 
INDYTENGIEFDPMLDERGYCCIYCRRGNRYCRRVCEPLLGYYP 
YPYCYQGGRVICRVIMPCNWWVARMLGRV 


6063 


71 


1079 


ETMAKNGPENCEDCHILNAEAFKS KKI CKSLKICGLVFGILALT 
LIVLFWGSKHPWPEVPKKAYDMEHTFYSNGEKKKIYMEIDPVTR 
TE I FRSGNGTDBTLB VHDFKNG YTGIYF VGLQKCFIKTQI KVIP 
EPSEPEEEIDEWEEITTTPPEOSVIWVPAEKPIENRDFLKNSKI 
DEICDNVTMYW\ INPTL\ ISGTFAKQLHHNFAFIILVSELQDFE 
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ID 
NO: 



60S5 



Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino stcid 
residue of 
amino acid 
sequence 



913 



Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



1153 



6066 



68" 



311 



A-nino acid segment containing signal peptid"^ 
(A=Alanxne, C=Cyeteine, 0=Aspartic Acid, B« 
Glutamic Acid, P«Phenylalanine, G=Glycine, 
H=Histidiner I^Isoleucine, K-Lysine, 
L=I*eucine, M^Methionine, K=Asparagl:ie, 
P=Proline, Q=GluCamine, R=Arginane, 
S=Ser ine , T=Threonine , V=Val ine , 
W=Tryptophan, Y=Tyrosine, X«Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotid e insertion) 
EEGEDLHPPANEKKGIEQNEQWWPQVKVKKTHHARQASEEELP 



641 



3470 



* *• V vrv vi\.vjiJ\iKtt/\KyAaEBEliP 

INDYTTENGIEFDPMLDERGyCCIYCRRGNRYCRRVCKPLLGYYP 
YPYCYQGGRVICRVIMPCNWWVAR(V7LGHV 

KLPQSLPRPTEHSPPYSLEKMTDLVAV WDVALgDGVHKIEFBHG 
TTSGKRWYVDGKEE IRKEWMPKLVGKETFYVGAAKTKATINID 
AISGPAYEYTLEINGKSLKKYMEDRSKTTWTWVIiHMDGENFRIV 
LE KDAMDVWCNGKKLETAGEP VDDGTETHFS I GTHXACY I KAV\ 
SSQ\KRKEGIIHTLIVONREIPElAS 

msvrvarvawvrglgasyrrgassfpv pppgaqgvabllrdatg 

AEEEAPWAATERRMPGQCSVI,LFPGQC3SQWGMGRGLLNYPRVR 

elyaaarrvlgydllelslhgpqetldrtvhcqpaifvasiaav 

EtCLHHLQPSVIENCVAAAGFSVGBFAALVFAGAM BFAEG 

vkenmpatrkpmryghteghtevcfddsgsfivtossdgdvriw 

BDLDDDDPKFINVGEKAYSCALKSGKLVTAVSNNTIQVHTFPEG 
VPDGILTRFTTNANHWFN6DGTKIAAGSSD\FL.VKIVI>VMDSS 
QQKTFRGHDAP VtiSLS FDPKD I PLASASCDGSVRVWQ ISDQTCA 
ISWPIiLQKCNDVINAKSrCRIAWQPKSGKLLAIPVEKSVKLYRR 
ESWSHQFDLSDNFISQTIiWIVTWSPCGQyLAAGSIWGI.IIVWNV 
ETKDCMERVKHEKGYAICGLAWHPTCGRISYTDAEQNLGLLENV 
CDPSGXTSSSKVSSRVEKDYNDIiPDGDDMSNAGDFLNDNAVEIP 
SFSKGIINDDEDDEDLMMASGRPRQRSHILEDDENSVDISMIiKT 
GSSLliKEEEEDGQEGSIHNLPLVTSQRPFYDGPMPTPRQKPFQS 
GSTPLHLTHRFWVWNS IGI IRCYNDEQDNAIDVEFHDTS IHHAT 
HLSNTLNnrriADLSHEAILLACESTDEIASKLHCLHFSSWDSSK 
EWI IDLPQNBDIEAICLGQGSflAAAATSALLIiRLPTIGGVQKEVF 
SLAGPWSMAGHGEQLFIVYHRGTGPDGDQCLGVQLLSLGKKKK 
QILHGDPLPLTRKSYLAWIGFSAEGTPCYVDSEGIVRMLNRGLG 
NTWTPICNTREHCKGKSDHYWWGIHENPQQLRCIPCKGSRFPP 
rLPRPAVAILSFKIiPYCQIATEKGQMEEQFWRSVIFHNHLDYLA 
ICNGYEYEKSTKNQATKEQQELLMKMIiALSCKLEREFRCVELADL 
MTQNAVNIAIXYASRSRKLIIAQKLSELAVEKAAELTATQVEEE 
EEEBDFRKKLWAGYSNTATEWSQPRFRWQVEEDAEDSGEADDEB 
KPBIHKPGQNSFSKSTNSSDVSAKSGAVTFSSQGRVNPFKVSAS 
S KEPAMS MNSARSTNILDKMGKSSKKSTALSRTTNNEKSPI I KP 
LIPKPKPKQASAASYFQKRNSQTNKTEEVKEENLKNVLSETPAI 
CPPQNTENQRPKTGFQMWLEENRStll LSDNPDFSDEADI I KEGM 
IRFRVLSTEERKVWANKAKGETASEX5TEAKKRKRVVDESDETEW 
QBEKAKENLNLSKKQKPLDFSTNQKLSAFAFKQE 



321 



6068 



6069 



13 



173 0 



583 ' 



liPWQKi.GVljLSRGKMAVTGWIiESI^TAQKTALLQD6RRKVtl5xF" 

PDGKEMAEEYDEKTSELLVRKWRVKSALGAMGQWQLEVGDPAPL 

GAGNLGPELIKESNANPIPMRKDTKMSFQWRIRNLPYPKDVYSV 

SVDQKERCIIVRTTNKKYYKKFSrPDLDRHQLPLDDALLSFA\T 
PTAP 



27 



GSIO^ADIANEEKPAIAPPVFVFQKDKGQKSPAEQKNLSDSGEEP 
RGEAEAPHHGTGHPESAGEHALEPPAPAGASASTPPPPAPEAQL 
PPFPREIiAGRSAGGSSPEGGEDSDREDGNYCPPVKRERTSSLTQ 
FPPSQSEERSSGFRLKPPTLIHGQAPSAGLPSQKPKEQQRSVLR 
PAVLQAPQPKALSQTVPSSGTNGVSLPADCT6AVPAASPDTAAW 
RSPSEAADEVCALBEKEPQKNESSNASEEEACEKKDPATQQAPV 
FGQNLRDRVKLlNESVDEADMBNAGHPSADTPTATNYFIiQYISS 
SLEMSTNSADASSNKFVFGQNMSERVLSPPFOiNEVSSDANRENA 
AAESGSESSSQEATPEKESLABSAAAYTKATARKCLLEiCVEVIT 
GEEAESNVLQMQCKLFVFDKTSQSWVERGRGLLRLNDMASTDDG 
TLQSRLSDAGPRGSLR\Lir.KTKLWAQMQIDKASEK\SIRlTAM 
DNEDQGVKVFX.rSASSKDrGQVyAALHHRILALRSRVEQEQEAK 
M PAPE PGAAPSNEEDDSDDDDVLAPSGATAAGAGDBGDGQTTGS 
T 



PTRPGQAGSSSAMAAQRLGKRVLSKLQSPSRARGPGGSPGGLQK 
RHARVTVKYDRRELQRRLDVEKWIDGKLEELYRGMEADMPDEIN 
IDELLBIiESEEERSRKIQGLLKSCGKPVEDFIQELIAKLQGLHR 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 
location 
c £ e £^ ^ ohq jl 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepti<3e 
{A=Alaniiie, C=Cysteine, D=sAspartic Acid, Ew 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine , 
PoProline, Q=Glutamine, RaArginine, 
S=Serine, T=Threonine, V=Vallne, 
W^Tryptophan, Y^Tyrosine, X=Unknown, *=!SCop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








QNPGLRQPS PS? \DGQPSAPFQGPGARTASPLTIiIiALFPG P PER 
RPALLCVLSCI 


S070 


478 


856 


IRVTVDQEPIjHYXFPLQFIiDSPBW/RPTETHRGRHF^QVTLTAE 
TDCRYVSWRRKKLYLLFAQHRY I SRLFSVLIGSD I ADKL YATiND 
RVYIGKRYKYDIRLPNFYQMSTPEIRRSPIiTQHFQNSRHYW 


6071 


2 


1654 


HEARTKGNMALARP\VRLFSLVTRLLLAPRRGLTVRSPDE?LPV 
VRIPVALQRQLEQRQSRRRNLPRPVIiVRPGPLLVSARRPELNQP 
ARLTLGRWERAPLASQGWKSRRARRDHFS I ERAQQEAPATOKLS 
SKGS FADLGAWKPRVLHALQE \AAPE VVQ\ PTTVQSST I PS LLR 

grhwcaaetgsgktlsyllpllqrllgXhpsldslpipaprgl 
vlvpsrsiiaqqvravaqplgrslgllvrdlkgghgmrrirlqls 
rqpsadvlvatpgalwkalksrlisleqlsflvldeadtlldas 
fleitvdyilekshi/vegpadledpfnpkaqlvlvgatfpbgvgq 
llnkvaspdavttitssklhcimphvkqtflrlkgadkvaelvh 

ILKHRDRAERTGPSGTVLVFCNSSSTVNWLGYILDDHKIQHLRIj 
QGQMPALMRVGIFQSFQKSSRDILLCTDIASRGIjDSTGVELVVN 
YDFPPTLQDYIHRAGRVGRVGSEVPGTVISFVTHPWDVSLVQKI 
ELAARRRRSLPGLASSVKEPLPQAT 


6072 


1 


742 


kmertemmptinsqlefkskpfplvsssrwlvkrgeltayvedt 
vlfsrrtskqqvyfflfndvliitkkksbesynvndyslrdqll 

VESCDWEELNSSPGKNSSTMLYSRQSSASHLPTLTVLSNHANEK 

vemllgaetqserarwitalghssgkppadrtsltqveivrspt 

AKQPDELSLQ VADWIil \ YQRVSDGWYEGER \liRDGER6WPPME 
CAKEITCQATIDKMVERMGRLU3LETWV 


6073 


620 


B60 


PCRRGIARPLSRRPG/S ILVHCAVGVSRSATLVIAYLMLYHHLT " 
LVEAXKICVKDHRGI IPNkGKLRQLLALDRRLRQGLEA 


6074 


16B 


1110 


pgarcmatelqcpdsmpchnqqvnsastpspeqlrpgdlilldha 
ggnrasrakvilltgyahsslpaeldsgacggsslnsbgnsgsg 
dsssydapagnspledcelsrqigaqlkllpmndqi relqti i r 

DKTASRGDFMFSADRIiIRLWEEGLNQLPYKECMVTTPTGYKYE 
QVKPEKGNCGVSIMRSGEAMEQGLRDCCRSIRIGKILIQSDEET 
QRAKVYyAKFPPDIYRRKVr»LMYPIIiQTG\NTVIEAVKVLIEHG 
VQPSVI ILLSLFSTPHGAKS I IQEFPEIXr L'JTTEVHPVAPTHFG 
QKYFGTD 


6075 


320 


1091 


PPTOQPQEVEHHXYGYVPILGNKTLPSRCHQCVIVSSSSHLLGT 
KLGPE lERAECTIRMNDAPTTG YSADVGNKTTYRWAHSS VFRV 
LRRPQEFVNRTPBTVF IPWGPPS KMQKPQGSLVRVIQRAGLVFP 

nmeayavspgrmrqfddtiprgetgkdrekshswlstgwftmvia 
velcdhvhvygmvppnycsqrprlqrmpyhyyepkgpdecvtyi 
qnehsrkgnhhrfi tekrvfsswaqlyg itfshps wt 


6076 


1721 


107 


hpspteaprvqhltmdctwrilflvaaatgthaqv'Olvqsgaev 
kkpgasvkvsckvsgytltelsmh&jvrqapgkglewmgapdped 

GETIYAQKFQGRVTMTEDTSTDTAYMELSSLR3BDTAVYYCATD 
HGDYAFDI WGQGTMVTVSSAPTKAPDVPPI ISGCRHPKDNSP W 
riACLITGYHPTSV\TVTWYMGTQSQA\QRTFPEIQRRDSYYMTS 
SQLSTPLQQWRQGEYKCWQHTASKSKKEIFRWPESPKAQASSV 
PTAQPQAEGSLAKATTAPATTRNTGRGGEEKKKEKEKEEQEERE 
TKTPECPSHTQPIiGVYLLrPAVQDLWLRDKATFTCPWGSDLKD 
AHLTWEVAGKVPTGGVEEGLLERHSNGSQSQHSRLTLPRSLWNA 
GTSVTCTLNHPSLPPORLMAIiREPAAOAPVKLSrJJljtJLS <5DPPT? 

A\ASWLLCEVSGPSPPNIIjLMWLEDHSEVNTSQPAPARPL?KP\ 
RSTTFWA\WSVI,RVPAPPSPQPATYTCWSHEDSRTLLNASRSL 
BVSYVTDHGPMK 


6077 


3637 


1266 


LLPDMNLQP I FWIGLISS VCCVFAQTDENRCLKANAKS CGBCid 
AGPWCGWCTNSTFLQEGMPTSARCDDLEALKKKGCPPDDIENPR 
GSKDI KKNKNVTNRS KGtABKLKPEDITQIQPQQLVLRLRSGEP 
QTFTLKFKRAEDYPIDLYYIiM\DLSYSMKDDIiENVKSIiGTDLMN 
EMRRITSDFRIGFGS PVEKTVM PYI STTPAKLRNPCTS EQWCTS 
PFSYKNVLSLTKTKGEVFNELVGKQRISGNLDSPEGGFDAIMQVA 
VCGSI*IGWKirVTRLLVFSrDAGFHFAGDGKIiGGIVIiPIIIX5QCHL 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A^^Alanine, C^Cysteine, D=.Aspartic Acid, E= 
Glutamic Acid, P=Phenyl alanine, G^Glycine, 
HaHistidine, I=Isoleucine, K-Lysine, 
LsLeucine, M=Methionine, K^rAsparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine> T=Threonine, V=Valine, 
Wr^Tryptophan, Y^Tyrosine, X=Dnknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EmTMrrMSHYyDYPSlAHLVQKLSEaNNIQTIFAVTEEFQPVYKE" 

LKNLIPKSAVQTLSAMSSNVIQLlIDAYNSIiSSEVILENGKLSE 

tiV risygsx \ CKNGVNGTGBNGRKCSNISIGDBVQFEISITSWK 

CPKXDSDSFKIRPLGFTEEVEVILQYICECECQSEGIPESPKCH 

EGNGrFECGACRCNEGRVGRHCECSTDEVNSEDIGCFTARKENQ 

FQKSASNHGRVPSAGQCVCRKRDIJTNE 1 YSGKFCECDNKNCDRS 

NGLICSGNGVCKCRVCECNPNYTGSACDCSLDTSTCEASNGQIC 

NGRGICECGVCKCTDPKFC2GQTCEMCQTCLGVCAEHKECVQCRA 

FNKGEKKDTCTQECSYFNITKVESRDKLPQPVQPDPVSHCKEKD 

VDDCWFYPTYSVNGNNEVMVHWEMPECPTGPDIIPIVAGWAG 

I VLIGIiALLLI WKLLM I IHDRREFAKFEKEKMNAKWDTGENP I Y 
KSAVTTWNPKYEGK 


6078 


1426 


180 


ETEDVMEL£,EEDLTCPICCSLFDDPRVLPCSHNFCKKCI^GILE 
GSVRNSLWRPVPFKCPTCRKKTFSYWELIPLQVNYSLKGIVEKY 
NKlKISPKMPVCKGH\LGQPLNIP\Cli\TDMQLDL/CGIC\ATR 
GEHTKHVFCSIEDAYAQERDAFESLFQSFETWRRGDALSRLOTL 
BTS KRKSLQIiLTKDSDKVKEFFEKLQHTLDQKKNEILSDFBTMK 
IiAVMQAYDPEINKLNTIIiQEQRMAPMIASAFKDVSBPIVFLQQM 
QEFREKIKVIKETPLPPSNLPASPLMKNPDTSQWEDIKLVDVDK 
LSLPQDTGTPISKlPWSFYKLFLLILLLGLVlVFGPTMFLEWSr. 
FDDLATWKGCI.SNFSSYLTKTADFISQSVFYWEQVTDGPFIFNE 
RFKWFTLWLNNVAEFVCKYKLL 


S079 


1586 


141 


ATARDi:.GCARRIDRVVMESTPSRGLNRVHIK>CRNLQEFLGGLSP 
GVLDRLYGHPATCLAVFREIiPSLAKNWVMRMLFLEQPLPQAAVA 
LVmCKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQ 
WLRIAIiUSGGKAWSDDTSQLGPDKHARDfVPSLDKYAEERWEWL 
HFMVGSPSAAVSQDIiAQI*r,SOAGIjyiKSrEPGEPPCITSAGFQFL 
LLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 
YSVEGMSDSLLNFLQHLREPGLVPQRKRKSRRYYPT/RALAINL 
SSGVSGAGGTVHQPGPIV\VETNYRLYAYTESEliQIAI.lALPSE 
MLYPFP\JJMW\ARVTR\ESVQQAIASGITAQQIIHFLRTRAHP 
V^a:.KQ^?VLPPTITDQII<l,WELERDRLRFTEGVLYNQFLSQVDF 

ellXlahapklgvlvfb/ntpakrlmwtpaghsdvkrfwkrqk 


6080 


1 


1199 


letidhvgefamaaqaagvsrqraatqalgsnqhalkylqqdfk 
tlrqqcldsgvlfkdpeppacpsalgyxdlgpgspqtqgiiwkr 

PTELCPSpQFIVGGATRTDrCQGGLGDCWLLAAIASLTI^EEIiL 

yrwprdqdfqenyagifhfqplcppspVfwqygewvewiddr 

LPTKNGQLLFLHSBQGNEFWSALLEKAYAKLNGCYEAIiAGGSTV 

egfedftggisefydlkkppanlyoiirkalcagsllgcsidvy 

SAAEAEAITSQKLVKSHAYSVrGVEEVNFQGHPEKLIRLRNPWG 

eveksgawsddapewnhidprrkeeldkkvedgefwmslsdpvr 
qfsrleicnlspdslsseevhkwnlvlfnghwtrgstaggcqny 

PGSS 


6081 
6082 


3 


865 


EMLPLIiIjPLPIiLWA/GAIiAQDARFRLEMPESVTVQEGLCIFVHC 

svfyleygwkdstpayghwprbgvsvdqetpvatnnstqkvqkb 

TQGRFHXiLGDPSRNNCSLSIRDARRRDWGSYFFWVARGRTKFSY 

kysplsvtvtalthrpdilipeflksghpsnltcsvpwvceqgt 
ppifswmsaaptslgprtlhssvltiiprpqdhgtnlicqvtfp 
gagvttertiqlsvswksgtveewvlavgwavkilllclcli 
I lsfhkkkavravevsenvyavmg 


6083 


283 
1865 


1288 

: 

] 
1 
} 
I 

309 I 


barspqptqtrtapqiaapglaqpaalrlllsrppsaamdgdgd 
pesvgqpeeaspeeqpeeasaeeerpbdqqeeeaaaaa\y\lde 

LPEPLI»A/LRVliAALPRHE\LVQACR\LVCt,RWKELVDGAPLWt> 

lkcqqeglvpeggveeerdhwqqpyplskrrrni*lrnpcgeedl 
sgwcdvehggdgwrveelpgdsgvepthdesvkkypassfewcr 
caqvidlqaegyweblldttqpaiwkdwysgrsdagcliysltv 
cllsehenvlaefssgqvavpqdsdgggwmeishtptoygpgvr 
^vrfehggqdsvywkgwpgarvtnssvwvep 

CQWCSVERRGLGMSlADEIiADLEEAAEEEEGGSYGEEBEEPAiE 
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10 

NO: 


Fredicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
. amino acid 
secfuence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
(A=Alanine, C-Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, Phenyl alanine, G-Glycine, 
H=Histidine, I^Isoleucine, K=Ly3ine, 
L=Leucine, M*- Methionine, N-Aaparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V-Valine, 
W^Tryptophan, Y=Tyro5ine, X=Unknown, *=stop 
Codon, /=poss±hle nucleotide deletion, 
\=possible nucleotide insertion) 


6084 






DVQEETQIJ3l>SGDSVKTIAKliWDSKMFAEIMMyTPPVTCvnBVA 
SEVMGPVEAAPEYRVIVDANNLTVEIENELNIIHKPIRDKYSKR 
FPELESLVPNALDyiRTVKELGNSLDKCKNNENLQQII^rNATlM 
WSVTASTTQGQQLSEEELERLEEACDMAJbELNASKHRIYEyVE 
SRMSPIAPNX.SIIIGASTAAKIMGVAGGLTNLSKMPAQrXMLLG 
AQRKTLSGFSSTSVLPHTGYIYHSDIVQSLPPIPPPPSVAPXDI. 
RRKAARLVAAKCTLAARVDSFHESTEGBCVGYELKBEIERKPDKW 

qepppvkqvkplpapldgqrkkrggrryrkmkerlglteirXko 

ANRMSFGErEEDAYQEDLGFSLGHLGKSOSGRVRQTQVWEATKA 

risktlqrtlqkqswyggkstirdrssgtassvaptplqglei 

_ VNPQAAEKKVAEAt?QKYFSSMAEFLKVKOEKSGLMST 


6035 


1865 


309 


KQWCAERRCilx^MSIiADELLADLEEAAEEKKGGSYGEEteEHPAIE " 
DVQE ETQLDLSGDS VKTIAKLWDSKMPAE T mm vt pttvt o vr^7^ vi, 

SEVMGPVEAAPEYRVIVDANKLTVEIENELKI IHKFIRDKYSXR 
PPELESLVPNALDYIRTVKELGNSLDKCKNNENLQQILTNATIM 
WSVTASTTQGQQLSEEELERLEEACDMALBLNASKHRIYEYVE 
SRMSFIAPNLSriIGASTAAKIMGVAGGLTNLSKMPACNIMLLG 
AQRKTLSGPSSTSVLPHTGYIYHSDIVQSLPPrPPPFSVAP\DL 
RRKAARLVAAKCTLAARVDSFHESTEGKVGYELKDEIERKFDKW 
QEPPPVKQVKPLPAPLDGQRKKRGGRRYRKMKERLGLTEIRXKO 
ANRMSFGEIEEDAYQEDLGFSLGHLGKSGSGRVRQTQVNEATKA 
RISKTLQRTLQKQSWYGGKSTIRDRSSGTASSVAFTPLQGLEI 
VNPQAAEKKVAEANQKYFSSMAEFLKVKGEKSGLMST 


6086 


2 


1456 


StJPRS FQGKRAVGklSI>GGKRNPEVTLLPGVSSERVRRWRRARV" 
GVARVKPGNPWKPSPATQVPR/VPAOWT.PfiprDDT Dtrrit^Br 

DEEAYVLYKRAQTGAPCLSFDIVRDHLGDNRTELPJCTDYLCAGT 

OAESAQSNRIrfOMLRMHNLHGTKPPPSEGSDEEEEEEDEEDEEER 

KPQLEliAMVPHYGGINRVRVSWLGEEPVAGVWSEKGQVEVPALR 

RLLQW2EPQALAAFLRDEQAQMKPIFSFAGHMGEGFALDWSPR 

VTGRLLTGDCQKNIHLWTPTDGGSWHVDQRPPVGHTRSVBDLQW 

SPTENTVFASCSABASIRIWDIRAAPSKACMLTTATA2TOGDVW 

ISWSRREPPLLSGGDDGALKIWDLRQFKSGSPVATFKQHVAPVT 

SVEWHPQDSGVPAASQADHQITQWDLG/XVBRDPEAGDVEAD^G 
liADLPQQLLFVHOQETEIiKET.MMMPrir'DrT T'^fe'r'TkT t^r^-n^^rr^Z.^ 


6087 


2419 


1357 


tJflAigHGGAMMl.X.PCNPHGNQLLYAGFNQDHGCFACGyiEKGFRy 

YNTDPLKEKEKQEFLEGGVGHVEMLPRCWYLALVGGGKKPKYPP 

NKVMIWDDUCKKTVIEIEFSTEVKAVKLRR\DKIWVLDSMIKV 

FTFTHNP\HQrjIVPE\TCYNPKGLCVLCPNSNNSLIAFPGTHTG 

HVQLVDLASTEKPPVDIPAHEGVLSCIALNLQGTRIATASEKGT 

LIRIPDTSSGHLIQELRRGSQAANIYCIMFNQDASliICVSSDHG 

TVHIPAAEDPKRNKQSSIASASFLPKYFSSKWSFSKFQVPSGSP 

CICAFGTEPNAVIAICADGSYYKFLFNPKGECIRDVYAQPLEMT 
DDKIi 


~T088 


476 


1877 

] 

; 

] 
1 


gwSQRTGIiPITIFSRSFPLLTGSDLCENMPCTCTWRNWRQWIRp"" 

LVAVIYLVSIWAVPLCVWELQKLEVGIHTKAWFIAGIFLLLTI 

PISLWVILQHLVHYTQPELQKPIIRILWMVPIYSLDSWIALKYP 

3IAIYVDTCRECYBAYVIYNFMGFLTNYLTNRYPNLVLILEAKD 

QQKHFPPLCCCPPWAMGEVLLFRCKLGVLQYTWRPFTTIVALI 

CELLGl YDEGNPSFSNAWTYLVI INNMSQLFAMYCLLLPYKVLK 

SELSPIQPVGKFLCVKLWFVSPWQAWIALLVKVGVISEKHTW 

-wy 1 vfiJiVAi LrbyuF 1 lCIEMPliAAIA\HHYTPSYKPYVQEAEE 

SSCFDSFLAMWOVSDIRDDISBQVRHVGRTVRGHPRKKLPPEDQ 

JQNBHTSLLSSSSQDAISIASSMPPSPMGHYQGHXSHTVTPQTTP 

['TAKrSDElLSDTIGBKKEPSDKSVDS 




1684 


689 ' c 
I 
K 
S 
Q 
1 T 


^ASGLVRLLQQGHRCLIiAPVAPKLVPPVKGVKKGFRAAPRPQKB 
.ERQRLLRCPPPPVRRSEKPNWDYHAEIQAFGHRLQENFSLDLL 
:TAFVNSCYIKgEEAICRQQLGlEKEAVLLNLKSNQEr.SEQGTSF 
QTCLTQFLEDEYPDMPTEGIKNLVDFLTGEEWCHVAliNLAVE 
LTLSEEPPVPPAVLQQTFFAVIGALt.QSSGPERTALFlRDFLI 
QMTGKELPBMWKIINPMGLLVBELKKRNVSAPESRLTRQSG\A 
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SEQ 
ID 
NO: 


Predicted " 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
\>*-Mj.anine, c^C/steme, D=Aspartic Acid, E= 
Glutamic Acid, F«Phenyl alanine, G^Glycine, 
H«Hiotidine, I«lQoleucine, K=Lysine, 
L=Leucine, M=Methionine^ NwAsparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S^Serine, T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6089 






PTALPLYFVGLYCDKKLIABGPGBTVLVAKKKAAKVALRKLYGF ' 
TENEU^PWNYSKPKETLRAEKSITAS 


6090 


3 


3054 


TRLGIPGSTISSRPRLCALAAEGHFLGHSWTGSRAGAH1^5APAVf~ 

PSRRLRDLPAGGMWRLRRAAVACEVCQSX.VKHSSGIKGStiPLQK 

LHLVSRSIYHSHHPTLKLQRPQLRTSFQQPSSLTNLPLRKLKFS 

PIKYGYQPRRNFWPARIiATRLLKLRYLILGSAVGGGYTAKKTPD 

QWKDMIPDLSEYKWIVPDXVWEIDEYIDFEKIRKALPSSEDLVK 

LAPDPDKIVESLSLLKDFFTSGSPEBTAFRATDRGSESDKHFRK 

VSDKBKIDQLQEELLHTQLKYQRILERLEKENKELRKLVLQKDD 

KQIPPIESLRKSLIDMYSEVLDVLSDYDASYNTQDHLPRVWVG 

DQSAGKTSVLEMIAQARIFPRGSGEMiyJTRSPVKVTLSEGPHHVA 

LFKDSSREFDIiTKEEDLAALRHEIELRMRKMVKEGCTVSPETIS 

LNVKGPGLQRMVLVDLPGVINTVTSGMAPDTKBTIFS ISKAYMQ 

DPNAIILCIQDGSVDAERSIVTDLVSQMDPHGRRTIFVIiTICVDI. 

AEKNVASPSRIQQIIEGKLFPMKALGYFAWTGKGNSSESIEAI 

REYEEEFFQNSKLLKTSMLKAHQVTTRNLSIAVSDCFWKMVRES 

VEQ0ADSFKATRFNLE7EWKNKYPRLRELDRNELFEKAKWSILD 

EVISLSQVTPKHWEEILQQSLWERVSTHVIENIYLPAAQTMNSG 

TFNTTVDIKLKQWTDKQLPNKAVEVAWETLQKEFSRPMTEPKGK 

EHDDIFDKLKEAVKEESIKRHKWNDPAEDSLRVIQHNALEDRSI 

SDKQQWnAAIYFMEEALQARLKDTENAIENMVGPDXWKKRWLYW 

KNRTQEQCVHNETKNELEKMLKCNEEHPAYLASDEITTVRKNLE 

SRGVEVDPSLIKDTWHQVYRRHPLKTAbNHCNLCRRGFYYYQRH 

FVDSELECNDWLFWRIQRMLAITANTLRQQLTNTEVRRLEKNV 

KEVLEDFABIXJEKKIKLLTGKRVQLAEDLKKVREIQEKLDAFIE 
AXiHQEK 


6091 


194 


1560 


PVFVPAPGAVIiEQAS/ASPPLATQTVVPLQHCKIPELPVQASin 
FELQtiFFCQLIALFVHYINI YKTVWWYPPSHPPSHTSLNFHIiID 
FNLLMVTTIVLGRRFIGSIVKEASQRGKVSLPRS r LUPLTRFW 
LTATGWSLCRSLtHLFRTYSFLWLL/FPLLSVWDVHSVPAAELR 
P\RKTSLFNHMASMGPRBAVSGtAKSRDYLLTr,R\RRGSSTQDS 
CMARTPCP/PHACCLSPSLIRSEVBPLKMDFNWRMKEVLVSSML 
SAYYVAPVPVWFVKNTHYYDKRWSCELFLLVSI STS VILMQHLL 
PASYCDLLHKAAAHLGCWQKVDPALCSNVLQHPWTESCWWPQGV 
LVKKSKNVYKAVGHYNVAIPSDVSHPRFHFFFSKPLRrLNILLL 

LEGAVIVYQLYSLMSSEKWHQTISLALILPSNYYAFFKLLRDRL 
Vt>GKAYSYSASPQRDLDHRFS 






3279 


412 

] 
i 
< 
2 
1 
S 
c 


l>SRTREMEERli;ix,RRQiRLLQGLIDDyKTLHGNAPAPGTEAASG~ 

WQPPTYHSGRAFSARYPRPSRRGYSSHHGPSWRKKYSLVNRPPG 

PSDFPADHAVRPLHGARGGQPPVPQQHVLERQVOLSQGQNWIK 

VKPPSKSGSASASGAQRGSLEEFEDTPWSDQRPREGEGEPPRGQ 

LQPSRPTRARGTCSVSDPLLVCQKEPGKPRMVKSVGSVGDSPRE 

PRRTVSESVIAVKASPPSSALPPRTGVALGRKLGSHSVASCAPQ 

MiGDRRVDAGHTDQPVPSGSVGGPARPASGPRQAREASLWTCR 

TMKFRKNNYKWVAASSKSPRVARRALSPRVAAENVCKASAGMAN 

KVEKPQLIADPEPKPRKPATSSKPG3APSKYKWKASSP3ASSSS 

SFRWQSEAGSKDHASQLSPVLSRSPSG0\RPAVGHSGLKPLSGE 

TPLSAYKVKSRTKIIRRRGSTSLPGDKKSGTSPAATAKSHIiSLR 

RRQALRGKS S PVliKKTPNKGLVQ VTTHRLCRLPPSRAHLPTKEA 

SSLHAVRTAPTSKVIKTHYRIVKKTPASPIiSAPPFPLSLPSWRA 

RRLSLSRSLVLNRLRPVASGGGKAQPGSPWWRSKGYRCIGGVLY 

KVSANKLSKTSGQPSDAGSRPLLRTGRLDPAGSCSRSLASRAVQ 

RSIAIIRQARQRREKRK3YCMYYNRFGRCNRaERCPYIHDPEKV 

WCTRFVRGTCKKTDGTCPFSHHVSKEKMPVCSYFLKGICSNSN 

:pyshvyvsrkaevcsdflkgycplgakckkkhtij:,cpdfarrg 
vcprgaqcqllhrtqkrhsrraatspapgpsdatarsrvsashg 
'rxpsasqrptrql'pssaaltaaavaapphcpggsaspsssea^s 

JSSSSSSSPPASLDHEAPSLQEAAIAAACSWRLCKLPSFISLQS 
; PSPGAQPRVRAPRAPLTKDSGKPLHIKPRL 




143 


3190 ■ 7 
£ 


iKAPPTGESSEPKAKVLHTKRLYRAWEAVHRLDLlLCNKTAYQ 
VFKPBNISLRNKriRELCVKrWFLHPVDYGRKAEErjQWRKVYYE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E^* 
Glutamic Acid^ F= Phenyl alanine, G=Glycine, 
H=Histidine, I-Isoleucine, K=Lysine, 
L«Leucine, M=Methionine, N«Asparagine, 
P=Proline, Q=G1 ut amine ^ R=sArginine, 
SaSerine, T^Threonine, V~ Valine, 
W«Tryptophan, Y?=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=*possible nucleotide insertion) 








VIQIilKTNKKHIHSRSTLECAyRXHLVAGIGFYQHLIiLyiCSHY 
QI^BLQCCIDWTHVTDPLIGCKKPVSASGKEMDWAQMACHRCLVY 
LGDLSRYQNELAGVDTELIAERFYYQALSVAPQIGMPFNQLGTL 
AGSKYYNVEAMYCYLRCIQSEVSFEGAYGMLKRIiYDKAAKMYHQ 
LKKCETRKLSPGKKRCKDIKRLLVNPMYLQSIiLQPKSSSVDSEL 
TSLCQSVLEDFKLCLFYLPSSPNLSLASEDEEEYESGYAFLPDL 
LIFQMVIICLMCVHSLERAGSKQYSAAIAFXriALFSHLVNHVNI 
RIjQAELEEGENPVPAFQSDGTDEPESKEPVEKEEEPDPEPPPVT 
PQVGEGRKSRKPSRLSCliRRRRHPPKVGDDSDLSEGFESDSSHD 

sarasegsdsgsdkslegggtafdaetdsemnsqesrsdledme 
eeegtrsptlepprgrseapdslngplgpseasiasnlqamstq 
mfqtkrcpriiaptfsnliilqpttnphtsashrpcvngdvdkpse 
paseegsesegsessgrscrnersiqbklqviimaegiilpavkvf 

LDWLRTNPDLIIVCAQSSQSLWNRLSVLLNLLPAAGELQESGIiA 

lcpevqdliiegcelpdlpsslllpedmalrklpplraahrrfnf 
dtdrpllstleeswriccirsfghfiarlqgsilqfntpevgif 
vs i aqs eqesllqqaqaqfrmaqeearrnrlmrdmaqiiriiqlev 
sqlegslqqpkaqsamspylvpdtqaiichhlpvrrqiiatsgrfi 

VI I PRTVIDGIiDLLKKBHPGARDGIRYLEAEFKKGNRYIRCQKE 
VGKSFERHKLKRQDADAWTriYKILDSCKQLT\LAQGAGEBDPSG 
MVTIITGIiPLDMPSLIiSGPMQAALQAAAHASVDIKNVLDFYKQW 
KEIG 


6093 


76 


1002 


ACGRRAMXiALRVART/SRWGALXRGAVWAPGTRPSKRRACWALL 
PPVPCCLGCIiAERWRLRPAAIiGIiRLPGIGQRNHCSGAGKAAPRX 
PAAGAGAAAEAPGGQWGPASTPSLYENPWTIPNMI>SMTRIGI*AP 
VliGYLI lEEDFNIAtiGVFAIAGLTDLLDGFIARHrWAWQRSALGS 
ALDPLADKILISILYVSLTYADLIPVPLTYMIISRDVMIiIAAVP 
YVRYRTLPTPRTLAKYPNPC YATARLKPTFI SSCVNTAVQLI LVA 
ASLAAPVFTTYADSIYIjQIbWCFTAFTTAASAYSYYHYGRKTVQV 
IKD 


6094 


23 


1010 


PFLRCLRGDQKAKMSERKVLNKYYPPDFDPSKIPKLKIiPKDRQY 
WRIiMAPFNMRCKTCGEYIYKGKKFNARKETVQNEVYIjGIiPIFR 
FyiKCTRCLAEITFKTDPENTDyTMEHGATRNFQAEKI.I,EEEEK 
RVQKEREDEELNNPMKVLENRTKDS KLEMEVLENLQELKDIiNQR 
QAHVDFEAMLRQHRIjSEEERRRQQQEEDEQETAALLEEARKRRL 
LEDSDS EDEAAPS PLQPALRPNPTAILDEAPKPKRKVBVWEQS V 

gslgsrpplsrlvwkkakadpdcsngqpqa/aphprspaeqeg 
gqpytpdawrvlpeptgcipgq 


6095 


a 


1599 


trgraaersrgrghgplgggfa\svtoyfpsedfyrcgycknes 
gsrsngmwahsmtvqdyqdlidrgwrrsgkyvykpvmnqtccpq 
ytircrplqfqpskshkkvlkkmlkfiakgevpkgsce\depmd 

STMDDAVAGDFAL INKLDIQCDIiKTLSDDI keslesegknskke 

epqellqsqdpvgeklgsgepshs 



TRADCXTS: M I6257.U%CSH01 ! .DOC) 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
CO first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding , 
to first 
amino acid 
residue of 
amino acid 
sequence 


AmLno acid segment containing signal peptide " 
(A^Alanine, C=Cysteine, D=:Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G«Glycine, 
H=Histidine, I=Isoleucine, K==Lysine, 
L=Leucine, M=Methionine, N-Asparagine , 
PsProline, Q*=Glut amine, RwArginine, 
S«Serine, T«Threonine, V=Valine, 
W«Tryptophan, Y«Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








VKVHTVPKPGKGADLSKPPCRKAKEIRKERECRLKLMQQNPAGEL 
EGFQAQGHPPSLFPPKAKSNQPKSLEDLIFESLPENASHKLEVR 
WRSSPPSSQFKATLLESYQVYKRYQMVIHKNPPDTPTESQFTR 
FLCSSPLEAETPPNGPDCGYGSPHQQYWLDGKIIAVGVIDIIiPN 
CVSSVYLYlfDPDYSFLSLGVySALREIAFTRQLHEKTSQLSyyY 
MGFYIHSCPKMKYKGQYRPSDLLCPETYVWVPIEQCLPSLENSK 
YCRFNQDPEAVDEDRSTEPDRLQVFHKRAIMPYGVYKKQQKDPS 
EEAAVLQYASLVGQKCSERMLLFRN 


6096 


2277 


575 


QR VRAALLS SAMEDSEALGFEHMGLDPRL LQ AVTDLG WSRPTL I 
QBKAI PLALEGKDLLARARTGSGKTAAYAI PMLQLLLHRKATGP 
WEQAVRGLVLVPTKBLARQAQSMIQQLATYCARDVRVANVSAA 
EDSVSQRAVLMEKPDWVGTPSRILSHLQQDSLKLRDSLELLW 
DEADLIiFSFGFEEELKSLLCHIiPRIYQAFLMSATFNEDVQALKB 
LIIiHNPVTIjKLQESQLPGPDQLQQFQWCETEEDKFLLLYAtiLK 
LSL1RGECSLLFVNTLERSYRL.RLFLEQFSIPTCVLNGELPLRSR 
CHIISQFNQGPYDCVIATDAEVLGAPVKGKRRGRGPKGDKASDP 
EAGVARGIDFHHVSAVLNFDLPPTPEAYIHRAGRTARANNPGIV 
LTFVLPTEQFHLGKIEELLSGENRGPIIJiPYQPRMEEIEGPRYR 
CRDAMRSVTKQAIREARLJCEIKEBLLHSBKLKTYFEDNPRXdIjQ 
LLRHDLPIaHPAWKPHLQHVPDYLVPPAIiRGLVRPHKKXGRSCL 
PliVGRPREQS PRTHCAAS S TKBRKS DP QPSPPE WG PLWS 


6097 


1673 


192 


APGTMSGGKKKSSFQITSVTTDYEGPGSPGASDPPTPQPPTGPP 
PRLPNGEPSPBPGGKGTPRNGSPPPGAPSSRFRWKLPHGLGEP 
YRRGRWTCVDVYERDLEPHSFGGLLBGIRGASGGAGGRSLDSRL 
ELASLGLGAPTPP5GLSQGPTSWLRPPPTSPGPQARSFTGGLGQ 
LWPSKAKAEKPPI^SASSPQQRPPEPETCSSAGTSRAATPLPSL 
RVEABAGGSGARTPPLSRRKAVE)MRIiRMELGAPBEMGQVPPLDS 

DSDDDSGSGSIjVGIDNKIEQAMDLVKSHLMFAVREEVEVLKEQI 
RELABRWAALEQEWrGLLRALA\SPEQIjGSAGPPRGVPR\LGPPA 
PNGPFVLSLPSLTIVPLGLPGIiASAAWPPLPMPALIVPVFPGVG 
VQALSNGPWSPGPLPHLLIIPSUXSGGEGFRTGRQQGAPFGEET 
QPPPSLPGTPQQ 




6098 


168 


1074 


nyclrhrsplekdsspgssstsllikkqretsdtpimralkeld 

rlnsasqrssslppsnrksstptkreimltpvtvayspecrspkb 
nlspgfshllsknesspirfdillddldtvpvstlqrtnprkql 
\qfiiplddseek\tysekat\rnivnhsscpbpvpngvkkvsvr 
taweknksvsyeqckpvsvtpqgndfbytakirtiaeterff\d 
eltbcekdqieaalsrmpspggritlqtrlnqeafgrsfgkd 




6099 


168 


1074 


egkifknwgtqtekedtsninprqtetsvnasrspekcaqqrqk 
rl^rsasqrssslppsnrksstptkrei^^ltpvtvayspkrspke 

NLS PGFSHIjLSKNESS pi RFD ILLDDLDTVPVSTLQRTJJPRKQI. 

\qflplddseek\tysekat\dnivnhsscpepvpngvkkvsvr 
taneknksvsyeqckpvsvtpqgwdfeytakirtiaeterfpXd 
eltkekdqleaalsrmpspggritlqtrlnqeafgrspgecd 




6100 


2 


713 


FVE V3GYRSRADPBPRGRDTMTYAYLFKYI I IGDTGVGKSCLLL 
QFTDKRPQPVHDLTIGVEFGARMVNIDGKQIKLQIWDTAGQESF 
RSITRSYYRGAAGAIiLVYDITRHETPNHLTSWLEDARQHSSSNM 
VIMLIGKKSDLESRRDVKREEGEAFARE\HGLrFMETSAKTACN 
VEEAFINTA3CE I YRKIQQGLPDVHNEANG I KIGPQQS ISTSVGP 
5ASQRNSRDIGSNSGCC 




6101 


1 


1399 


FRGRAWPLREVSHWLGCRRVCSWSASWGRLPALSARLSPlitAFR 
GKMVFPLSCAVQQYAWGKKGSNSEVARLIiASSDPLAQIAEDKPY 
AELWMGTHPRGDAKILDNRISQKTLSQWIAENQDSLGSKVKDTF [ 
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SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D^=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
HsHistidine, I=Isoleucine, K^Lysxzie, 
L^Ijeucine^ M=:Methionine, N=Asparagine, 
PsProline, Q=Glutamine, R«Arginine, 
S=rSerine, T=«Threonine , V^sValine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *!^Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








NGNLPFL FKVLS VETPIiS IQAH PNKELAEKLHLQAPQHYPDANH 
KPEMAIALTPPQGLCGFRPVEEIVTFLKICVPEFOFLIGDEAATH 
LXQTMSHDSQAVASSLQSCPSHLMKSEKKVVVBQLlffLLVKRISQ 
QAAAGNNMEDIPGELLLQLHQQYPGDIGCFAIYPliNXiLTLKPGE 
AMFLEANVPHAyLKGDCVECMACSDKTVRAGLTPKFIDVPTLCE 

mlsytpssskdrlflptrsqedpylsiydppvpdftimkaNevp 

GXSVTEYKDLALDSAS IIiDKVQGTVIASTPTTQTPI PLQRGGVL 
F IGANESVSLKLTEPKDLLI FRACCLL 


6102 


70 


2415 


QTPQATLAANGAEDSRGGEMLPAG3IGASPAAPCCSESGDERKN 
LEEKSDINVTVIjIGS KQVSEGTDNGDLPS YVSAF IE KEVGNDIiK 
S LKKLDKL I EQRTVS KMQLEEQVLTISSE I PKRIRSALKNAEES 
KQPliNQFLBQETHLFSAINSHLLTAQPWMDDLGTMISQIEEIER 
HLAYLKMISQXEEtiSDNIQQYLMTNNVPBAASTLVSMAEIjDIKIi 
QESSCTHLLGFMRATVKFWHKIIiKDICLTSDFEEILAQLHWPFIA 
PPQSQTVGLSRPASAPEIYSYLETLFCQLLKLQTSHELIjTEPKX 
HSQKNTLFLPPLLSS/WPIQVMLTPLQKRFRYHFRGNRQTNVLS 

kpewylaqvlmwignhtefldekiqpildkvgslvnarlbfsrg 
lmmlvleklatdipcllyddnlfchlvdeviiliferelhsvhgyp 
gtfascmhilseetcfqrwltverkpalqkmdsmlsseaawvsq 
ykditdvdemkvpdcaetfmtlllvitdryknlptasrklqfle 

LQKDLVDDFRIRLTQVMKEBTRASLGFRYCAILNAVTfYrSTVLA 
DWADWFFLQLQQAALEVFAENNTLSKLQLGQLASMESSVFDDM 
INLLERLKHDMLTRQVDHVFREVKDAAKLYKKERWLSLPSQSEQ 
AVMSIiSSSACPLLLTLRDHtiLQLEQQLCFSLFKIPWQMLVEKLD 
VYIYQEIILANHFNEGGAAQL,QFDMTRNLFPLFSHYCKRPENYF 
KHIKEACIVLNLNVGSALTAGKDVLPVQLQGSFPAT 


6103 


207 


2523 


ESNSTMTTYIjEFIQQNEERDGVRFSWNVWPSSRIiEATRMVVPVA 
ALFTPLKERPDLPPIQYEPVLCSRTTCRAVLNPIiCQVDYRAKLW 
ACNFCYQEJNQFPPS YAGI S ELNQPAE LLPQFS S lEY WLRGPQM 
PLI FLYWDTCMEDEDLQAIiKESMQMS LSIiLPPTALVGL I TFGR 
MVQVHELGCEG I SKS YVFRGTKDLSAKQLQEMLGLSKVPVTQAT 
RGPQVQQPPPSNRFLQPVQKIDMNLTDLLGELQRDPWPVPQGKR 
PLRSSGVALSIAVGLLECTFPNTGARIMMFIGGPATQGPGMWG 
DELKTP IRSWHDIDKDNAKYVKKGTKHFEALANRAATTGHVID I 
YACALDQTGLIiEMKCCPNLTGGYMVMGDS FNTSIiFKQTFQRVFT 
KDMHGQPKMGFGGTLEIKTPR\EIKISGAIGPCVSLNSKGPCVS 
ENEIGTGGTCQWKICGLSPTTTLAIYFSVVNQHWAPIPQGG\RG 
A\IQFVTQY\QHSSGQRRIRVTTIARN\WADAQTQIQNIAASFD 
QEAAAILMARLAIYRAETEEGPDVLRWLDRQLIRLCQKFGEYHK 
DDPSS FRFSETFSIiYPQFMFHLRRSSFJjQVFNWS PDESS YYRHH 
FMRQDLTQSLIMIQPILYAYSFSGPPEPVLLDSSSILADRILLM 
DTFPQILIYHGETIAQWRKSGYQDMPEYENFRHLLQAPVDDAQE 
IIiHSRFPMPRYIDTEHGGSQARFLLSKVNPSQTHNNMYAWGQES 


6104 


124 


732 


KVSEYIILSKDKILFHALAMIiVLVVSPWSAARGVLRNYWERLLR 
KLPQSRPGFPS PPWGPALAVQVAQPCLQSQQMI P VEVKRI /RSL 
LDSI FWMAAPKNRRTIEVNRCRRRKPQKLIKVKNNIDVCPECGH 
LKQKHVLCAYCYEKVCKETAErRRQIGKQEGQPFKAPTIETWL 
YTGETPSEQDQGKRIIERDRKRPSWFTQN 


6105 


3 


983 


PLHGACTSLVI>QRFCHRRPRPCAPARPEDMRRPAAVPLLIiLLCF" 

gsqrakaatacgrprmlnrptvggqdtqegewpwqvsiqrngshf 
cggsliaeqwvltaahcprntsetslyqvllgarqlvqpgpham 
yarvrqvesnplyqgtassadvalveleapvpftnyilpvcxpd 
psvifbtgmncwvtgwgspseedllpeprilqklavpixdt\pr 

CNLLYSKDTEFGYQPKTIKNDMLCAGFEEGKKDACKGDSAQPLV 
CIjVGQSWLQAGVISWQEGCAEQNRPGVYIRVTAHHNWIHRI I PK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, Cs=Cysteine, D=Aspartic Acid, E= 
Glutamics Acid, F=Phenylalanine, GsGlycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L«l*eucine, M=Methionine, N^Asparagine^ 
P=Proline , Q=Glut amine , R^Arginine , 
S=Serine , T=Threonine , V=Valine , 
WsTryptophan, Ys=Tyrosine, X=Unknown, *«*Stop 
Codon, /^possible nucleotide deletion, 
\=poesible nucleotide insertion) 








LQVQPSEVGRPEVTPPGPGAP 


6106 


3 


1302 


GRPPTAPHTGRPPTANRGDPRLDLKRGCARLLTSIESRGRPAAS 
AGLRRDRCALRRWPLRRAPLARATRRRAGSPRRCAPRPRACPQG 
WSRARHQPGGLCLLIiLIiLCQFMEDRSAQAGNCWLRQAKNGRCQV 
1 LYKTEl.SKEECCSTGRLSTSWTEEDVMDin'LFKWMI FNGGAPNC 
IPCKETCEN\'DCGPGKKCRMNKKNKPRCVCAPDCSNITWKGPVC 
GLDGKTYRNECAIiLKARCKEQPELEVQYQGRCKKTCRDVFCPGS 
STCV\VDQTNNAYCrVTCNRICPEPASSEQyLCGNDGVTYS\SAC 
HLRKATCLLGRS IGLAYEGKCI KAKSCEDIQCTGGKKCLWDPiCV 
GRGRCSLCDELCPDSKSDEPVCASDNATYASECAMKBAACSSGV 
LLEVXHSGSCNS r SEDTEEEEEDEDQDYSPPl S SILEW 


6107 


523 


1^8 


SRCSS PRPEPGRGRGK/ LSPSEHRKWVE VFKACDEDHKGYLSRE 
DF KTAVVMLFG YKPSKI E VDS VMS S INPNT SG I LIiEGP^ 
KKBAQRYRKEVRH 1 FTAFDTYYRGPLTLEDFKKAFRQVAPKLPE 
RTVLBVFREVN DRDS\DGHVSF 


61Q8 


3 


1346 


GGSLRFSPPRVPS.CSRVFCPVPPGGCGLPSPMSASRPQSPTTPW 1 
CLPRRYMKHKRDDGPEKQEDEAVDVTPVMTCVFWMCCSMLVLL 1 
YYFYDLIiVYWIGrFCLASATGLYSCLAPCVRRLPXSASAGESA : 
LIAPTI PNNSL PYPHKRPQARMLUtALFCVAVS WWGVFRNEDQ 
WAWVLQDALGIAFCIiYMLKTIRLPTFKACTLLLLVIjFLYDIFFV 
FITP FLTKSGSS I MVEVATGPSDSATRBKLPMVLKVPRLNSS PO 
ALCDRPFSLXiGFGDILVPGLLVAYCEiRFDIQVQSSRVYFVACTI 
AYGVGLLVTFVALALMQRGQPALLYLVPCTLVTSCAVALWRRBb 

PATS PWPAEQSPKSRTSEEMGAGAPMREPGS PAES EGRDQAQPS 
PVTQPGASA 


6109 


1 


1381 


CRSRAGAASGGAILEGTKLRRQRVDTNKPLDPLVPSALRAAMLY 
LEDYLEMIEQLPMDLRDRFTEMREMDLQVQNAMDQLEQRVSEFF 
MNAKKNKPEWREEQMASIKKDYYKALEDADEKVQIiANQIYDIiVD 
RHLRKLDQELAKFKMBIiBADNAGITEILERRSLEtiDTPSQPVim 
HHAHSHTPVEKRKYNPTSHHTTTDHIPEKKFKSEALLSTLTSDA 
SKEHTLGCRNNNSTASSNNAYNVNSSQPLGSYHIGSLSSGTGAG 
GI \TMAAAQAVQATAQMKEGRRTSSLKASYEAPKNMDFQLGKEF 
SMARETVGYSSSSALMTTLTQNASSSAADSRSGRKSKNNNKSSS 
QQSSSSSSSSSLSSGSSSSTWQEISQQTTWPESDSNSQVDWT 
YDPNBPRYCXCNQVSYGEMVGCDTQDCPIEWFHYGCVGLTEAPK 
GKWYCPQCT\AAMKRRGSRHK 


6110 


77 


2464 


ACPSAATMSDQDHSMDEMTAWKIEKGVGGNNGGKGNGSGAFSQ 
ARSSSTGSSSSTGGGGQESQPS PLAIJ.A ATCSR IBS PNENSHNS 
QGPSQSGGTGELDLTATQLSQGANGWQIISSSSGATPTSKEQSG 
SSTNGSNGSESSKNRTVSGGQYWAAAPNLQNQQVLTGLPGVMP 
NIQYQVIPQPQTVDGQQLQFAATGAQVQQDGSGQIQIIPGANQQ 
I ITtmCSGGNIIAAMPNIJ^QQAVPLQGLANNVI/SGQTQyVTWVP 
VAIiNGNITLLPVNSVSAATLTPS SQAVTI SSSGSQESGSQP VTS 
GTriSSASLVSSQASSSSFFTNANSYSTTTTTSNMGXMNFTTSG 
SSGTKSQGQrPQRVSGLCJGSDALNIQQWQTSGGSLOAGQQKEGE 
Q\NQQTQAAPKSI.SRPQLVQGG\QALQ\AFQAAPLSGQTFTTQA 
ISQETLQMLQLQAVPNSGPI I IRTPTVGPN6QVSWQTLQLQNLQ 
VQNPQAQTITIiAPMQGVSLGQTSSSNTTLTPIASAASIPAGTVT 
VNAAQI^SMPGLQTINLSALGTSGIQVHPIQGLPLAIANAPGDH 
GAQLGIiHGAGGDGIHDDTAGGEEGENSPDAQPQAGRRTRREACT 
CPYCKDS EGRGSGDPGKKKQHI CH IQGCGKVYGKTSHIjRAHLRW 
HTGERPFMCrwsyCGKRFTRSDELQRHKRTHTGEKKFACPECPK 
RPMRSDHIiSKHIKTHQNKKGGPGVALSVGTLPLDSGAGSEGSGT 
ATPSALITTNMVAMEAI CPEGIARLANSGINVKEGGQFCS PINT 
SANGF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
res idue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E=: 
Glutamic Acid, F= Phenylalanine^ G=Glycine, 
H=Histidine/ 1= Jsolevicine , K= Lysine , 
Xi-Leucine . M=Methionine . NsAsoaracrinp 
P=Prollne, Q=Glutamine, R=Arginine, 
S=Serlne, T=Threonine, V=Valine, 
W-Tryptophan, Y=:Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 


6111 


1637 


797 


rvdprvrqamapwgkrlagvrgvlldisgvlydsgagggtaIag 
sveavarlkrsrlkvrfctnesqksraelvgqlqrlgfdiseqe 
vtapapaacqilkergijrpyllihdgv\asefdqidts/stpnc 
wiadagesfsyqnmnnafqvlmelekpvlislgkgryyketsg 

LMLD VGP YMKALE YAOG I KABVGG KPS PE FFKS ALQ A IGVE AHQ 
AVMIGDDIVGDVGGAQRCGMRALQVRTGKFRPSDEHHPEVKADG 
YVDNLAEAVDLLLQHADK 


6112 


77 


136 


MSSHKSFKSKRFIAKKQKPNRPILQWIWLKTGNKIRHNWK 


6113 


1779 


567 


WEGRSWAACGVNLQGAWGERSGVRASEAESPGKRADVSWWSRQL 
ETMVDHLANTE INSQRI AAVESCFGASGQPLALPGRVLLGEGVL 
TKECRKKAKPRIFFLFISIDILVYGSIVLNKRKYRSQHIIPLEEVT 

LEIjLP etlqaknrwmi ktakksfwsaasaterqbw ishieecv 
rrqlratgrpa\stehaapwipdkatdicmrctqtrfsaltrrh 
hcrkcrvwcaecsrqrfllprlspkpvrvcslcyrelaaqqrk 
eeaeeqgagvpraashlarp i cgr pvemtmtptrtrraagtatg 

PAAWSSTPRGWPGIjPSTADPRPAEHIjS PSQLHCPGPQEGS SRSC 
PGLRDP I PWWQVQRWGVALSGLPVPFCWTLCPYGFTAGNAFPFR 
KPQNTHRSW 


6114 


818 


246 ' " 


f J. oJKlf Kir*^ ifJWlb Wt>*\\^VaAfWaa & Wf A£>fa & W Ft-(5PRRCCTR 
RRRCSPRCGLAAGSMCSCSPSWRCTPVPACWPSPPP\PAEQVQC 
GHLPPHADRRALRLPVAAPARGPGPGHPAGPAGPRPARTPPASP 
HGPGRPTVPAPPCPLIAATEPTPSRPHQRWTRBDRI4LGRGSQVT 
GRPQWFLRGLVLFSIi 


6115 


324 


71 


DVCGRVGAHPHLYTHIHMHICAHACXIHTHAQLC/ ITASHALAH 
SHLYTCMVMLTASHTPSHTHPHTAVHKEHRADVLRGTLTPIiR 


6116 


595 


1430 


TGVMPPGRWHAA/ 1 SSSGPVFEGARA\ LQTVKKEEEDESYTPVQ 
AARPQTLNRPGQEIiFRQIiFRQLRYHESSGPLBTLSRIiRELCRW^ 
LRPDVLSKAQILELLVLEQFLSILPGEIjRVWVQLHNPESGEEXL 
t.WKauKva rXjMGHPGGl RALjP \BPRCAI*UGYKS \IjKSAQ1 WSL 
ASPLRSSSALGDHLEPPYEIEARDFLAGQSDTPAAQMPALFPRE 
GCPGDQVTPTRSLTAQLQETMTFKDVEVTFSQDEWGWLDSAQRN 
IiYRDVMLENYRNMASLGK 


6117 


1433 


222 


vgvpspappcswbvgpgggwtpgxlkegqggrrtpllllatrtr 
glilslfppaamhpaafplpvwaavlwgaaptrgliratsdhma 
smdftidlpalfgatlsqeglqgpiiveahpdnacspiappppapv 

NGSVF I ALLRRFDCNFDLKVLKAQKAG YGAAVVHNVBTSNELIiNM 

vwnsebiqqqiwipsvfigersseylralfvyekgaevllvpdk 
tfplgyylipftgivgllvlamgavmiarciqhrkrlqrnrltk 
\eqlkqi\pthdyqkgdqydvcaicldeyedgdklrvlpcahay 
hsrcvdpwjutqtrktcpickqpvhrgpgdedqeeetqgqbegde 
geprdhpasertpllgssptlptsfgslapaplvfpgpstdppl 
sppsspvilv 


6118 


1044 


247 


stiscractsgatpgaqshrsarghaaggketaarigmergkvkk 
kekeketqkekigekgreekvkrkeveqkikqekqekqerrkgk 
ekeekrtkqgketnkekeqfkgqeekgenk0stltrtpleplek 
nkqilvlglik3agktsvlhslasnrvqhsvaptqgfhavcinte 
dsqmefleiggskpfrsywemylsn/adslarsfsvgfkqdsqp 
itwkakkylhqliaanpvlplwfankqdleaayhitdiheaia 

II 


6119 


1217 


462 


DPRFVTBNTTKAPAQBRTTQPRSSREGTLRSTMEYLSALNPSDL 
LRS VSN IS SEFGRRVWTSAPPPQRP PRVCDHKRTIRKGLTAATR 
QELIiAKALBTLLLNGVLTLVLEEDGTAVDSEDFFQLLEDDTCLN 
VLQSGQSWSPTRSGVLSYGLGRERPKHSKDIARFTFDVYKQNPR 
DLFGSLNVKATFYGLYSMSCDFQGL\GPKKVIiRELLRWTSTLLQ 
GLGHMLU3 ISSTLRHAVEGAEQWQQKGRIiHSY 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleot ide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C-Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H-Histidine, I=Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, W=Aeparagine, 
PsfProline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V-Valine, 
W=Tryptophan, Ys=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6120 


785 


179 


liERAGGGGIiSSRAIjVGSGACIiSLVARANGKGbPRGRKEFVEAVR 
VRYVAFRYRTPRAVCLRLWSCRREVIMSGRGKQGGKVRAKAKSR 
SSRAGLQFPVGRVHRLLRKGITYAERVGAGAPVYIAAVLEYLTAE 
I LELAGNAARDNKKTRI I PRHLQLAI RNOEELNKLLGKVTI AQG 
G\VLPNIQAVLLPKKTESQKDEGANDP 


6121 


1612 


107 


FVRAOARGSRQPVRRPIiLGAGSRLRCRSCGRMEPLKVEKFATAN 
RGNGLRAVTPLRPGELLFRSDPLAYTVCJCGSRGWCDRCLLGKE 
KLMRCSQCRVAKYCSAKCQKKAWPDHKRECKCIiKSCKPRYPPDS 
VRLLGRWFKLMDGAPSESEKLYSPYDLESWINKLTEDKKEGLR 
QLVMTFQHFMREEIQDASQLPPAFDLFEAFAKVI CNS FTICNAE 
t4QEVGVGL YPS ISLLNHSCDPNCS I VPNGPHLLLRAVRDIEVGE 
ELTICYLDMLMTSEERRKQLRDQYCFECD\CFRCQTQDKD7yDML 
TGDBQVWKEVQBSLKKIEEL KAHWKWEQVLAMCQAI IS SNSERL 
PD INI YQLKVtiDCAMDACINLGLIiEEAbFYGTRTME P YRI FFPG 
SHPVRGVQVMKVGKLQLHQGMFPQAMKNLRLAFDIMRVTHGREH 
SLI EDLILLLE /AMRRQHQS I LRERSQREIRRVSLLHALLRSHT 
LCFVSCVNLSYWKFCSVPV 


6122 


2 


2324 


RFRKMADGGAASQDESSAAAAAAADSRMNNPSETSKPSMESGDG 
NTGTQTNGLDFQKQPVPVGGAISTAQAQAFLGHLHQVQLAGTSL 
QAAAQSIjNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQLMIiA 
GGQITGLTLTPAQQQItLLQQAQAQAQIiliAAAVQQHSASQQHSAA 
GATISASAATPMTQIPLSQPIQIAQDIiQQLQQLQQQNLNLQQFV 
LVHPTTNLQPA\QFIISQTPQGQQGLLQANQNLLTQLPRQSQAN 
LLQSQPRI\TLTSQPATPTCTIAATPIQTLPQSQSTPKRIDTPS 
LEEP\SDLEELEQFAKTFKQRRIKLGFT\QGDAGLAMVKLYGND 
FSPTTIFRFBALNLSFKNMCKLKPLI*EKWLNDAENIiSSDSSLSS 
PSALNSPGIEGIiSRRRKKRTSIEA\NIRVALEKSFIiEN'\QKPTS 
EEITMIADQIiNMEKGVIRWFCNRRQKEKRINPPSSGG\TSSSP 
IKAIFPSPTSIiVATTPSLVTSSAATTLTVSPVLPLTSAAVTNLS 
VTGTSDTTSNNTATVISTAPPASSAVTSPSLSPSPSASASTSEA 
SSASETSTTQTTSTPIiSSPLGTSQVMVTASGIiQTA/AQLLPFKG 
AAQLPAWASIAAMAAAAGLNPSLM7VPSQPAAGGALLSLNPGTLS 
GALSPALMSNSTLATIQALASGGSLP I TSLDATGNLVFANAGGA 
PNIVTAPLFLNPQNLSLLTSNPVSLVSAAAASA6NSAPVASLHA 
TSTSAES IQNSLFTVASASGAASTTTTASKAQ 


6123 ■ ■ 


3 


2944 


HLLHRWFGTDMQMINFTTGE FQLTEACP YliGTHS EESRFGI LHL 
HLQPLEMKRVGWFTPAEYGKVTSLI LI RNNIiTVIDMI GVE6FG 
AREDLKVGGRLPGAGGSIiRFKVPESTLMDCRRQLKDSKQILSlT 
KNFKVENI GPLP IT VS SLKINGYNCQG YGFEVfiD CHQFS LDPNT 
SRDISIVFTPDPTSSWVIRDLSLVTAADLEFRFTUm'LPHHLL 
PLCADWPGPSMEESFWRLTVFFVSIiSLLGVILIAFOQAQYILM 
EPMKTRQRQNASSSSQQNNGPMDVISPHSYKSNCKNFLDTYGPS 
DKGRGKNCLPVNTPQSRIQNAAKRSPATYGHSQKKHKCSVYYSK 
HKTSTAAASSTSTTTEEKQTS PLGSSLPAAKEDI CTDAMRBNWI 
SI^YASGINVNLQKNLTLPKNLLNKEENTLKKTTIVFSNPSSECS 
MKEGIQTCMFPKETDIKTSENTAEPKBIiELCPLKTSiaa.PENHL 
PRNSPQYHQPDLPE I SRKNNGNNQQVPVKNBVDHCENLKKVDTK 
PSSEKKIHKTSREDMFSEKQDIPFVEQEDPYRKKKLQEKREGNL 
ONLNWSKSRTCRKNKKRGVAPVSRPPEQSDLKLVCSDFERSELS 
SDINVRSWCIQESTREVCKADAEI/VSSLPAAQREAEGYYQKPEK 
KCVDKFCSDSSSDCGSSSGSVRASRGSWGSWSSTSSSDGDKKPM 
VDAQHFIiPAGDS VSQNDFPSEAPISIxNLSHN I CNPMTGNSLPQ Y 
AEPSCPSLPAGPTGVEEDKGLYSPGDLWPTPPVCVTSSLNCTLE 

ngvpcviqesapvhnsfidwsatcegqfssaycplelndymafp 
ebhmktyakgfpcpadvqtdpidhwsqstwktppXnmpasXwgna 

QFPSSSRPYI.KSTPKACIjPMSGLFGPI\WAP\QSDVYENCCPIN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine^ G-Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=:Methionine , N=Asparagine , 
psproline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, v^Valine, 
WsTryptophan, y=Tyrosine, X^Unknown, *^stop 
Codon, /^possible nucleotide deletion, 
\»p033ible nucleotide insertion) 








PTTEHS D / THMENQA\ WCKEYYPGF \NPFRAy MNLD I WTTT \ A 
NRNANFPLSRDSSXCGNV 


6124 


1573 


236 


SDEALRLAGERGMGRVQLFEISLSHGRWYSPGEPLAGTVRVRIi 
GAPLPFRAXRVTCIGSCGVSKTKANDTAWWEEGYFNSSLSLADK 
GSLPAGEHSFPFQFLL.PATAPTSFEGPFGKIVHQVRAAIHTPRF 
S KDHKCSLVFY IIiSPLNLNS I PDIBQPNVASATKKFSYKLVKTG 
SWIiTASTDLRGYWGQALQLHADVENQSGKDTSPWASLLQKV 
S YKA ECRW I HDVRT lAE VEGAG VKAWRRAQWHEQ I LVPALPQSAL 
PGCSLIHIDYYLQVSLKAPEATVTLPVFIGNIAV/NPCPSEPPA 
RPGAASWGPTPGG\ PSAPPQEEAEAEAAAGGPHFLD P VFLSTKS 
HSQRQPLIiATIiSS VPGAPE P CPQDGS PASHP liHPPLCISTGATV 
PYFAEGSGGPVPTTSTLILPPEYSSWGYPYEAPPSYEQSCGGVE 
PSLTPES 


6125 


1 


904 


KTCP KIiTCAFTVS VPDS CCRVCRGDGEIiS WEHSDGDI FRQPAKR 
EARHS YHRSHYDPP PSRQAGGLSR FPGARSHRGALMDSQQASGT 
I VQ I VINNKHKHGQVCVSNGKT YSHGES WHPNIiRAFG I VECVLC 
TCNVTKQECKKIHCPNRYPCKYPQKIDGKCCKVCPG/KKAKEEL 
PGQSFDNKGYFCGEETMPVYESVFMEDGETTRKIALETERPPQV 
EVHVWT IRKGILQHFHI EKIS KRMFEELPHFKIjVTRTTLSQWKI 
FTEGEAQ I S QMCSSRVCRTEIjEDIiVKVLYIjERSEKGHC 


6125 


1224 


389 


RLLSEAP CPRSRRRFQMNPEWGOAFVHVAVAGGLCAVAVPTG I F 
DSVSVQVGYEHYAEAPVAGLPAFIiAMPFNSIiVNMAYTLLGLSWL 
HRGGAMGIiGPRYLKDVFAAMALLYGPVQWLRLWTQWRRAAVLDQ 
WLTLPIFAWPVAWCLYLDRGWRPVWLFLSLECVSLASYGLALLH 
PQGFEVALGAHVVPAVGQALRT\HRHYG/SATPSATYIiALQVLS 
CLGFWLKLCDHQtARWRLFOCLTGHPWSKVCDVIiQFHFAFLFri 
THFWTHPRFHPSGGKTR 


6127 


1335 


463 


VLPRRCLVFWNTMDSSREPTLGRLDAAGFWQVWQRFDADEKGY 
lEBKELDAFFLHMLMKLGTDDTVMKANIiHKVKQQFMTTODASKD 
GRIRMKELACMFLSEDENFLLLFRRENPLDSSVEPMQIWRKYDA 
DSSGFISAAELRNFLRDItFriHHKKArSEAKLEEYTG™MKIFDR 
NKDGRLDLNDLARILALQENFLLQFKMDACSTEKRKGDFEKIPA 
YYDVSKTGALEGP\EVDGFVKDMMELVQPSISGVDLDKFRBILL 
RHCDVNKDGKI QKS ELALCr,GLKINP 


6X2 B 


2511 


943 


TCRMSRRQLERWVWSSQQVQARGRIJVRAPRLGKIAMGLEMSSKD 

SPGSLDGRAWEDAQKPQSAWCGGRKTRVYATSSRRAPPSEGTRR 

GGAARPEKTAEBGPPAAPGSLRHSGPLGPHACPTALPEPQVTSA 

MSSQWG I EPLY 1KAEPASPDSPKGSSETETEPPVALAPG\ PAP 

TRCLPGHKEEEDGEGAGPGEQGGGKLVLSSLPKRIiCLVCGBVAS 

GYHYGVASCEACKAFFKRTIQGSIEYSCPASNECEITKRRRKAC 

QACRFTKCLRVGMLKEGVRLDRVRGGRQKYKRRPEVDPLPFPGP 

FPAGPLAVAGGPRKTAAPVNALVSHLLWEPEKLYAMPDPAGPD 

wHLiP AVA 1 i^CDLFDRE I VVT I SWAKS I PGFSS LSL S DQM 

VWMEVLVIjGVAQRSIiTLQDEL7^AEYLVIjDEEGARPAGLGELG\ 

AALLQLVRRLQAIiRLEREHYVbLKAIiALANSDSVHIEDEPRLWS 

S CEKLLHEALLE YEAGRAGPGGGAERRRAGRLLLTLPLLRQTAG 

KVIAHFYGVKLEGKVPMHKL FLSMLEAMMD 


6129 


1764 


771 


ARFARSAHEGKMPKKKTGARKKAENRREREKQLRASRSTIDIiAK 
HPCNASMECDKCQRRQKNRAFCYFCNSVQKLPICAQCGKTKCMM 
KSSDCVIKHTVGVYSTGIiAMVGAlCDFCEAWVCHGRKCLSTHACA 
CPLTDAEC\VECERGVWDHGGRIFSCSFCHNFLCEDDQFEHQAS 
CQVIjEAETFKCVSCNRLGQHS CLRCKACFCDDHTRSKVFKQBKG 
KQPPCPKCGHETQETKDLSMSTRSLKFGRQTGGEEGDGASGYDA 
YWKNLSSDKYODTSYHDEEEDEYEAEDDEEEEDEGRKDSDTESS 
DLFTNIiNLGRrYASGYAHYEEQEN 


6130 


3 


577 


GRGGTMRE YKWVLGSG \GVGKSALTV\QFVTCTFIEK YDPTI E 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=:Cysteine, D=A3partic Acid, E== 
Glutamic Acid, F=Phenylalanine, G«Glycine, 
H»Histidine, I=Isoleucine» K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q^Glutamine, R=Arginine, 
S=Serine, T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








DFVRKEIEV\DSSPSVAGISWTQQGTEQF\ASMRDLYIKKGQGC 
I liVYSLVNQQSFQ \DI KPMRDQI IRVKVSEKVPVI \ LVGN\S VD 
LESEREVSSSEGRALAEEWGCPFP4ETSAK$KTM VDEL FAE I VRQ 
MWYAAQPDKDDPCCSACNIQ 


6131 


3 


1811 


SSPREKTSDSSHRPSRHGFLFLRLVGLSPFSYLCVPPSRPVPGS 
PRSLSAMRLLPLAPGRLRRGS PRHLPSCS PALLLLVLGGCLGVF 
GVAAGTRRPNWliXiLTDDQDEVIiGGMTPLKKTKALIGEMGiyiTFS 
SAYVPSALCCPSRAS ILTGKYPHNHHWWNTLEGNCSS KSWQKI 
QE PNTFPAI LRSMCG YQTFFXAGKYLNE YGAPDAGGLEHVPLGW 
SYWYALEKNSKYYNYTLSIKGKARKHGENYSVDYLTDVIANVSL 
DFLDYKSNFEPFFMMTATP\APHSPWTAAPQYQKAFQNVFAPRN 
KNFNIHGTNKHWLTRQAKTPMTNSSIQFLDNAFRKRWQTLLSVD 
DLVEKLVKRLEFTGELNNTYIFYTSDNGYHTGQPSIiPIDKRQLY 
EFD I KVPLLVRGPGIKPNQTS KMLVANIDLGPT r LD I AGYDLNK 
TQMDGMSLLPIUiGASNLTWRSDVIiVEYQGEGRNVTDPTCPSLS 
PGVSQCFPDCVCEDAYKNTYACVRTMSALWNLQYCEFDDQEVFV 
BVYWLTADPDQITNIAKTIDPELIiGKMNYRLMMLQSCSGPTCKT 
PGVFDPGYRFDPRLMFSNRGSVRTRRPSKHLL 


6132 


56 


1241 


AAGLLPPGLVPSDPRRTRNLLPFGIQGPPFALSRPLiFSCVSSGW 
AWEAMEPEFLYDLLQLPKGVEPPAEEELSKGGKKKYLPPTSRKD 
PKFEELQKPA\VLMEWINATLLPEHIVVRSLEEDMFDGIiILHHL 
FQRX<AALKLEAEDIALTATSQKHK1*TWLEAVNRS\CSWRSGRP 
SGA/WES I FNKDLLSTLHLLVALAKRFQPDLSLPTNVQVEVITI 
ESTKSGLKSEKXiVEQLTEYSTDKDEPPKDVFDELFKIAPEKVNA 
VKSAIVNFVNQKLDRr,GLSVQNLDTQFADGVILLLLIGQI*EGFP 
LHLKBFYIiTPNS PAEMLHNVTLALE1,L/ IGRGPAQLPC /WdjK/ 
TIVNKDAKSTLRVLYGLFCKHTQKAHRDRTPHGAPW 


6133 


2 


f 4256 — ■ 


FVHGSPlADTDriFMECEEEEIiEPWQKI SDVIEDS WEDYNS VDKT 
TTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGAQNSDSTK 
KTLVTLlANNNAGNPIiVQQGGQPLILTQNPAPGLGTMVTQPVLR 
PVQVMQNAWHVTSSPVASQPIPITTQGFPVRNVRPVQiTAMNQVG 
rVLNVQQGQTVRPITLVPAPGTQPVKPTVGVPQVFSQMTPVRPG 
STMPVRPTTNTPTTVIPATLTIRSTVPQSQSQQTKSTPSTSTTP 
TATQPTSLGQLAVQSPGQSNQTTNPKLAPSFPSPPAVS IAS FVO: 
VKRPGVTGENSNEVAKLVNTLNTIPSLGQSPGPVWSNNSSAH\ 
GSQRTSGPESSMKVTSSIPVFDLQDGGRKICPRCNAQFRVTEAL 
RGHMCYCCPEMVEYQKKGKSLDSEPSVPSAAKPPSPEKTAPVAS 
/THPSSTPIPALSPPY/TKVPEPNENVGDAVQTKLIMLVDDFYY 
GRDGGKVAQLTNFPKVATSFRCPHCTKRLKNNIRFMNHMKHHVE 
LDQQNGEVDGHT ICQHCYRQFST PPQLQCHLENVHS PYESTTKC 
KI CEWAPESE PIjFI.QHMKDTHKPGEMPYVCQVCQYRS SLYSEVD 

vhfrmihedtrhllcpyclkvpkktgnapqqhymrhqkrKnvyhX 
cnkcrvqplfakdkibhklqhhktfrkpkqlsqlkpgtkvtira 
srgqprtvpvssndtppsalqeaapltssmdplpvflyppvqrs 
iqkravrkmsvmgrqtclecsfeipdfpotfptyvhcslcryst 

CCSRAYAKHMINNHVPRKSPKYLAIiFKNS VSG I KIiACTSCTFVT 
S VGDAMAKHLVPNPS HRS SS IL PRGLTWI AHSRHGQTRDRVHDR 
NVKNMYPPPSFPTNKAATVKSAGATPAEPEELLTPLAPALPSPA 
STATPPPTPTHPQALALPPIiATEGAECIiNVPDQDEGSPVTQEPE 

lasggggsggvgkkeqlsvkklrwlfalccnteqaaehfrnpq 
rrirrwlrrfqasqgenlegkylsfeaeeklaewvltqreqqlp 
vneetlfqkatkigrsleggfkisyewavrfmlrhhltpharra 

VAHTIjPKDVAENAGLFIDFVQRQ IHNQDLPLSMI VAIDE I SLFL 
DTEVLSSDDRKENALQTVGTGE PWCDWLAILADGTVLPTLVFY 
RGQMDQPANMPDSILtiEAKESGYSDDEIMELWSTRVWQKHTACQ 
RSKGMLVMDCHRTHLSEEVLAMLSASSTLPAWPAGCSSKIQPL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepti'H^ 

C= Cysteine. Ds;a«ina-rHf* tr^i/^ t> 
Glutamic Acid, F^Phenyl alanine, G«Glycine, 
H=Histidine, It»Isoleucine, K^Lyslne, 
Ij«Leucine, ^-Methionine, W=Asparagine , 
P= Proline, Q=Glut amine, R=;Arginine, 
S=Serine, T=Threonine, V= Valine, 
W^Tryptophan, Y=Tyrosine, X«Unknown, +=.Stop 
CodoH; /=pos Bible nucleotide deletion, 
\-possible nucleotide insertion) 


5134 






DVCIKRTVKj^fLHKKWKEQAREMADTACDSDVLLQIiVLVWLGEV 

LGVIGDCPELVQRSFLVASVLPGfDGIiriNSPTRNADMQEELIAS 

liEEQLKLSGEHSESSTPRPRSSPEETIEPBSLHQLFEGESETES 
FYGFEEADLDLMEI 


€135 




425^ 


FVHGSMADTDLFMECEEESLEPWQKISDVIEDSWEDYNSVDKT 

TTVSVSQQPVSAPVPIAAKEASVAGHLSTSTTVSSSGAQNSDSTK 

KTLVTLIANNNAGKPLVQQGGQPLILTQNPAPGLGTMVTQPVLR 

PVQVMQNANHVTSSPVASQPIFITTQGFPVRNVRPVQNAMNQVG 

IVLNVQQGQTVRPITLVPAPGTQFVKPTVGVPQVFSQMTPVRPG 

STMPVRPTTNTFTTVIPATLTIRSTVPQSQSQQTKSTPSTSTTP 

TATQPTSLGQliAVQSPGQSNQTTNPKIAPSFPSPPAVSIASFVT 

VKRPGVTGENSNEVAKLVNTLNTIPSLGQSPGPVWSNNSSAH\ 

GSQRTSGPESSMKVTSSIPVFDLQDGGRKICPRCNAQPRVTEAL 

RGHMCVCCPEKtVEYQKKQKSLDSEPSVPSAAKPPSPEKTAPVAS 

/THPSSTPIPALSPPy/TKVPEPNENVGDAVQTKLlMLVDDFYY 

GRTOGKVAQLTNFPKVATSFRCPHCTKRLKNNIRFMNHMJCHHVB 

I-DQQKGEVDGHTICQHCYRQFSTPFQLQCHLENVHSPYESTTKC 

KICEWAFESEPLFLQHMKDTHKPGEMPYVCQVCQYRSSLYSEVD 

VHFRMIHEDTRHLLCPyci,KVFKKrGNAFQQHYMRHQKR\jmrH\ 

CNKCRVQFLFAKDKIEKKLQHHKTFRKPKQLEGLKPGTKVTIRA 

SRGQPRTVPVSSNDTPPSALQEAAPLTSSMDPLPVPLYPPVQRS 

IQKRAVRKMSVMGRQTCLECSFSIPDFPNHFPTYVHCSLCRYST 

CCSRAYANHMINNHVPRKSPBOTiALFKNSVSGIKLACTSCTFVT 

S VGDAMAKHLVFNPSHRSSS rLPRGLTV7IAHSRHGQTRDRVHDR 

iNvxuMwiffftairFXN KAATVKS AGATPAE PEELLTPIiAPAL PS PA 

STATPPPTPTHPQALALPPLATEGAECLNVDDQDEGSPVTQEPE 

IiASGGGGSGGVGKKEQLSVKKUlWLFALCCKTEQAAEHFHNPQ 

RRIRRWLRRPQASQGENLEGKYLS FEAEEKLAEWVLTQREQQLP 

VNEETLFQKATKIGRSLEGGPKISYEKAVRFMLRHHLTPHARRA 

VAHTLPKDVAENAGLPIDFVQRQIHNQDLPLSMIVAIDEISLFL 

DTEVLSSDDRKBNALQTVGTGEPWCDVVLAirADGTVLPTLVFY 

RGQMDQPANMPDSII^IiEAKESGYSDDEIMELWSTRVWQKHTACQ 

RSKGMLVMDCHRTHI,SEEVLAMLSASSTLPAWPAGCSSKIOPi 

DVCIKRTVKNFLHKKWKEQAREMADTACDSDVLLQLVLVWLGBV 

LGVIGBCPELVQRSFLVASVLPGPDGWIWSPTRNADMQEBLIAS 

LEEQLKLSGEHSESSTPRPRSSPEETIEPBSLHQLPEGESETES 
FYGFEEADLDU^E I 




2 


4256 

( 


FVHGSMADTDUFMECEEEELEPWQKISDVIEDSWEDYNSVDKT 
TTVSVSQQPVSAPVPIAAHASVAGHLSTSTTVSSSGAQNSDSTK 
KTLVTLIANNNAGWPLVQQGGQPLILTQNPAPGLGTMVTQPVLR 
PVQVMQNAWHVTSSPVASQPIFITTQGFPVRI^VRPVQNA^4NQVG 
IVLNVQQGQTVRPITLVPAPGTQPVKPTVGVPQVFSQMTPVRPG 
STMPVRPTTNrFTTVIPATLTIRSTVPQSQSQQTKSTPSTSTTP 
TATQPTSLGQLAVQSPGQSNQTTNPKLAPSFPSPPAVSIASPVT 

vkrpgvtgensnevaklvntlntipslgqspgpvwsnhssahN 
gsqrtsgpessmkvtss i pvfdujdggrkicprcnaqfrvteal 
rghmcyccpbmveyqkkgksldsepsvpsaakppspektapvas 
/thpsstpipalsppy/tkvpepwenvgdavqtklimlvddfyy 

GRDGGKVAQLTNFPKVATSFRCPJICTKRLKNNIRFMNHMKHHVE 

ldqqngbvdghticqhcyrqfstpfqlqchlenvhspyesttkc 
kicewapeseplflqhmkdthkpgempyvcqvcqyrsslysevd 

VHFRMIHEDTRHt,LCPYCLKVFKHGNAFCJQHYMRHQKR\NVYH\ 
CtTKCRVQFLFAKDKIEHKLQHHKTFRKPKQLEGIiKPGTKVTIRA 
SRGQPRTVPVSSNDTPPSAIiQEAAPLTSSMDPLPVFLYPPVQRS 
rQKRAVRKMSVMGRQTCI.ECSFEXPDFPNHFPTYVHCSLCRYST 
XSRAYAiraMINNHVPRKSPKYLALPKNSVSGIKLACTSCTPVT 
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SHQ 
ID 
NO: 


1 Predicted^ 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
cor resp ondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Am,ino acxd segment containing signal peptide ' 
(A:^lanine, C«Cysteine, D=Aspartic Acid, 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K« Lysine, 
L=Leucine, M=Methionlne, N=AspEiragine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, TsThreonine, V^Valine, 
W=tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SVGDAMAKHLVFNPSHRSSSILPRGLTWIAHSEiHGQTRDRVmfR"^ 

NVKNMYPPPSPPTNKAATVKSAGATPAEPEELLTPIxAPALPSPA 

STATPPPTPTHPQAIiALPPLATEGAECLNVDDQDEGSPVT^EPE 

LASGCSGGSGGVGKKEQLSVKKLRWLFALCCNTEQAAEHFRNPQ 

RR rRRWLRRFQASQGENLEG KVI^S FEAEEKLAEWVLTQREQQI.P 

VWEETLFQKATKlGRSLEGGFKISyEWAVRFMLRHtUiTPHARRA 

VAHTLPKDVAENAGLFIDFVQRQIHNQDIiPLSMIVAlDEISIiFL 

DTEVLSSDDRKENALQTVGTGEPWCDWIAlIiADGTVLPTLVFy 

RGQMDQPANMPDSIIiLEAKESGYSDDEIMELWSTRVWQKHTACQ 

RSKGMLVMDCHRTHLSEEVLAMLSASSTLPAWPAGCSSKIQPL 

DVCIKRTVKNFLHKKWKEQAREMADTACDSDVLLQLVLVWLGEV 

LGVIGDCPELVQRSFLVASVLPGPDGNINSPTRSADMQEELIAS 

LEEQLKLSGEHSESSTPRPRSSPEETIEPESLHQLFEGESETES 

FYGFEEADLDLMEI 


6136 


1704 


539 


j FGVRMALEGMSKRKRKRSVQEGENPDDGVRGSPPEDYRLGQVAS 
SLPRGEHHSRGGTGRLASIiFSSLEPQIQPVyvPVPK\ESALASA 
DLBBEIHQKQGQKRKNSQPGVKVADRKILDDTEDTWSQRKKIQ 
INQEEERLKNERTVFVGNLPVTCNKKKLKSFFKEYGQIESVRFR 
SLIPAEGTLSKKLAAIKRKtHPDQKNINAYWFKBESAATQALK 
RNGAQIADGFRIRVDLASETSSRDKRSVFVGNLPYKVEESAIEK 
HFLDCGSIMAVRIVRDKMTGIGKGFGYVLFBNTDSVHliALKLNN 
SELMGRBCLRVMRSVNKEKFKQQNSNPRLKNVSKPKQGLNFTSKT 
AEGHPKSLFIGEKAVLLKTKKKGQKKSORPKKQRKQK 


6137 


141 


2656 


RALRKRRCGPGRRGALGSGPGPQRRPGRVPEERPAPPRERICHPG 

MWNMLIVAMCLA\IjLGLPGKAQELQGHVS\lIIiAGEQLGDliAKK 

YLWQG\LFQLYLDEAGRGHS PSFHGAALTAPKQGQElWAKALES 

LSCPKDMAPSHCAEHKDQFLQLSQYRQLKTAEDYQALNKDIEAQ 

LQHAGLREAGGIFYFSVPPPAYEDIARNINSSCRPGPGAWLRW 

LEKPPGHDHPSAQQLATELGTFFQEEEMYRVDHYLGKQAVAQIIi 

PFRDQNRKALDGLWNRHHVERVEIIMKETVDAEGRTSFYEEYGV 

IRDVLQNHLTEVLTLVAMEIiPKNVSSAEAVLRHKLQVFQALRGL 

QRGSAWGQYQSYSEQVRRELQKPDSFHSLTPTFAGVLVHIDNL 

RWEGVPFILKSGKALDERVGYARILFKNQACCVQSEKHWAAAQS 

QOjPRQLVPHIGHGDLGSPAVLVSRNLFRPSLPSSWKEMEGPPG 

LRLPGSPLSDYYAYSPVRERDAHSVLLSHIFHGRKJTFPITTENL 

LASWNFWTPLLESLAHKAPRLYPGGAENGRLLDFEPSSGRLFFS 

QQQPEQLVPGPGPGPMPSDFQVLRAKYRESSIiVSAWSEELlSKL 

ANDIBATAVRAVRRPGQPHLAIiSGQSSPVALPQQIATAHYGFPW 

AHTHLWLVDERCVPL3DPESNFQGLQAHLLQHVRIPYYNlH\At/I 

PVHLQQRIiCAEEDQGAHIYAREISALGANSSFDLVLLGMGADGH 

TASLFPQSPTGLDGEQLWLTTSPSQPHRRMSLSLPLrNRAKKV 

AVLVMGRMKREI TTLVSRVGHEPKKWP ISGVLPHSGQLVW YMDY 

DAFLG 


6138 


4587 


934 


EPS KLTDRWQNAVQGVRORKGDVDGLVROWODPTTc? VPMT i>t> t?t — 

TDTSHLLSAVKGQERFSLYQTRSLIHELKNKEIHFQRRRTTCAL 

TLEAGEIOjLLTTDLKTKESVGRRISQLQDSWKDMBPQLAEMIKQ 

FQSTVETMDQCEKKIKELKSRLQVLKAQSEDPLPELHEDLHNEK 

EIiIKELEQSLASWTQNLECELQTMKADLTRHVLVEDVMVLKBQIB 

HLHRQWEDLCLRVAI R KQEIEDRLNTWWFBrEKNKELCAWLVQM 

ENKVLQTADISIEEMIEKXQKDCMEEINLPSBNKLQLKQMGDQL 

IKASNKSRAABIDDKLNKINDRWQHLFDVIGSRVKKLKETPAFI 

qqldknmsnlrtwrarieselskpwydvcddqeiqkrlaeqqd 
lqrdieqhsagvesvfnicdvllhdsdacanetecdsiqqttrs 

ta)RRWRNICAMSMERRMKIEETWRLWQKFLDDYSRFEDWI.KSAB 
RTAACPNSSEVLYTSAKEELKRFEAFQRQIHERLTQLELINKQY 
RRLARBNRTDTASRLKQMVHEGNQRWDNLQRRVTAVLRRIjRHFT 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A= Alanine, C=Cysteine, D=A3partic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycirie, 
H==Hastidine, I=Isoleucine, K=Lysine, 
L=Leucine, M^Methionine, N=Asparagine , 
P=Prol ine , Q=Glutamine , R=Arginine , 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 
Codon, /=pos Bible nucleotide deletion, 
\=spoeEible nucleotide insertion) 








NQREEPEGTRES I liVWUTEMDLQLTNVEHFSESDADDKMRQLNG 

FQQE2TI»NTNKlDQIjIVFGEQLIQKSEP\LDAVLIEDELEEriHR 

YCQEVr GRVSRFHRRLiTSCTI Jrt:»ljEDi£KJELAolLKiii JJIXlliUPKoiy 

DS WRKRGESEEPSS PQSLCHLVAPGHERSGCETPVS VDS \ I PLE 

WDHTGRRGGPSSSH\EEDEEAQYY\SALSGKSISDGHSWHVPDS 

PSCPEHHYKQMEGDRWVPPVPPASSTPYKPPYGKLLLPPGTDGG 

KEGPRVLNGNPQQEDGGIiAGITEQQSGAFDRWEMIQAQELXHITK 

I»K I KQNLQQLNSD I S AITTWLKKTEAELEMLKMAKPPSDIQE IE 

IiR VKRLQE ILKAFDTYKAL WSVWVSSKE FLQTESPESTELQS R 

LRQLSLLWEAAQGAVDSWRGGLRQSLMQCQDFHQLSQNLLLWIA 

SAKNRRQKAHVTDPKADPRALLECRRELMQLEKELVERQPQVDM 

LQEISNSLLIKGHGEDCIEAEEKVHVI\EKKLKQLREQVSQDLM 

ALQGIQNPASPLPSFDSVDSGDQPPATSVPAPRAKQPRAVRTTE 

GEEETESRVPGSTRPQRSFLSRVVRAALPIiQLI*LLLLIiLIiACLIi 

PSSEEDYSCTQANNF\ARSFYPMLRYTNGPPPT 


S139 


52 


1131 


LGDWVWSRTCGVLETPTSVLRRARARGPCPTDSKWALPRLREGE 
TERR PWEASS WKTL/liAGW IGGAAS VI VGHP LDTVKTRLQAaVG 
YGNTLSCIRWyRRESMFGFFKGMSFPIiASIAVYNSWFGVFSK 
TQRFLSQHRCGEPEASPPRTLSDLLLASMVAGWSVGLGGPVDL 
I KI RLQMQTPP VSGRQPRFEVQGSGSCG \EPAYQGPVHCITTI V 
RNEGLAGLYRGASAMLLRDVPGYCLYFIPYVFliSEWlTPEACTG 
PSPCAVWLAGGMAGAI S WGTATPMDWKSRLQADGVYLNKYKGV 
LDCISQSYQKEGLKVFFRGITVNAVRGFPMSAAMFLGYELSIiQA 
IRGDHAVTSP 


6140 


694 


136 


RPELELWRLRSRSWRPLGVPRRCHRRNWKEPVRAQPLSVTVWAP 
RCQRP/QPPAPEPSSPNAAVPEAIPTPRATiASAALELPLGPAPV 
SVAPQAEAEARSTPGPAGSRLGPETPRQRFRQPRYQDAAGPREA 
FRQIiRBL/ SPRQWLRPD I \RTKEQ\ I VEMLVQEQLLAI LPEAAR 
ARRIRRRTDVRITG 


6141 


2 


984 


AQVGPRSRPCKMPLKLRGKKKAKSKETAGLVEGEPTGAGGGSLiS 
ASRAPARRLVFHAQrAHGSATGRVEGFSSIQEIiYAQIAGAFEIS 
PSE I LYCTLNTPKIDMERIiLGGQLGIjEDFI FAHVKGIEKE VNVY 
KSEDSLGLTITDMGVGYAFIKRIKDGGVIDSVKTICVGDHIBSI 
NGENIVGWRHYDVAKKLKELKKEELFTMKLIBPKKAFE I ELRSK 
AGKSSGEKIGCGRATLRLRSKGPATVEEMPSETKAKXAIEKIDD 
VLELYMGIRDIDLATTMFEAGKDKVNPDEFAVALDETLGDFAFP 
DEPVFDVWGVIGDAKRRGI/ 


6142 


116 


602 


BAEGEQVCGAKCCGDAPHVENRBEETARIGPGVMESKEERALNN 
XiIVEWVNQENDEKDEKEQVANKGEPLALPLfclVSEYCVPRGNRRR 
PRVRQP I LQ YRWDIMHRLGBPQARMREENMERIGEE VRQIiMEKL 
REKQLSHStRAVSTDPPHHDHHDEFC\LMP 


6143 


2802 


270 


FRMRIFjbHCPWNQQMWKIWNMiBTSLESCKAHIiSIQKIiLKERXQ 
\QLPVFiCHRDSIVETLKRHRVWVAGET\GSGKSTQVPHFIiLED 
T .TjT ,N RWE As KTN T VCTO PRR I SAVSliAMRVCDELGCENGPGGRN 
SLCG YQ r RMESRACESTRLLYCl'TGVLLRKLQEDGLLSNVS /HM 
FIVDEV\HER\SVQSDFIiLIILKEILQKRSDLHLILMSATVDSE 

kfstyfthcpilrisgrsypvevfhlediieetgfvlekdseyc 
qkfleeesevtinvtskaggikkyqeyipvqtgahadlnpfyqk 
yssrtqhallymnphkiniidlileiilayldkspqprniegavli 
plpglahiqqlydllsndrrfyserykvialhsilstqdq7vaaf 
TLPP PGVRKIVLATNIAETG I ti pdwfvidtgrtkenkyhes S 

QMSSLVETPVSKASALQRQGRAGRVRDGFCFRMYTRERFEGFMD 
YSVPEIIiRVPLEELCLHIMKCNLGSPEDPIiSKALDPPQLQVISN 
AMNLLRKIGACELNEPKLTPLGQHLAALPVNVKIGKML I FGAI F 
GCLDPVATIAAVMTEKSPFTTPIGRKDEADLAKSALAMADSDHL 
TIYNAYLGWKKARQEGGYRSEITYCRRNFLNRTSLLTUEDVKQE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(Arjaanine, C=Cysteine, D«Aspartic Acid, E= 

H=:Histidine, I^Isoleucine, K^Lysine, 
L=Leucine, M^Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S==Serine, T=Threonine, V= Valine, 
W=Tryptophan, y«Tyrosine, X=Unknown, *=Stop 
\^ija.%jiif / =possijoxe nucjieocxae cieiecion, 
V^possible nucleotide insertion) 








LIKLVKAAGFSSSTTSTSWEGNRASQTIiSFQEIALLKAVLVAGL 
YDNVGKI rYTKSVDVTEKIiACIVETAQGKAQVHPSSVNRDLQTH 
GWLLYQEKXRYARVYLRETTLITPFPVIiLFGGDIEVQHRERLLS 
IDGWIYPQAPVKIAVIFKQLRVLIDSVLRKKLBNPKMSLENDKI 
LQIITELIKTENN 


6144 


1289 
» 


568 


SGPGSMSGQRVDVKWMLGKEYVGKTSLVERYVHDRFLVGPyQN 
VS ASGGARHGGRGSGGP VTCT YGPDIiFPL VA V TIGAAFVAKVMS 
VGDRTVTLGIWDTAGSERYEAMSRIYYRGAKAAIVCYDLTDSSS 
FERAKFWVKELRSLEEGCQIYLCGTKSDLLEEDRRRRRVDFHDV 
UU I Aurj J. aAUXj FJS TS S KTGQ S VDELiFQKvAED YVSVAAFQVMTE 
DKGVDLGQKPNPYFYS CCHH 


6145 


X109 


196 


GGMDLSELERDNTGRCRLSSPVPAVCRKEPCVLGVDEAGRGPVL 
G PMVYAICYCPLPRLADIiEALKVADSKTLLES ERERLFAKMEDT 
DFVGWALDVLSPNLISTSMLGRVKYWIjNSLSHDTATGIiIQyALD 
QGVNVTQVFVDTVGMPETYQARLQQSFPGIEVTVKAKADALYPV 

WsaasicakvardqavkkwqfveklqdldtdygVsgypndpqd 

/TKAWLKEHVEPVF\GFP\QFVRF\SWRTAQT1\LEKEAEDVIR 

edsasenqeglrkitsyflnegsqarprsshryflerglestts 

L 


6146 


428 


781 


LKKKGKEKAEAQQVEALPGPSIiDQWHRSAGEEEDGPVIjTDEQKS 
R/ YPGHEAHDQGG\WDARQSI IRKWDPETGRTRLI KGDGEVIiE 
EIVTKERHREINKQATRGDCIiAFQMRAGIiIiP 


6147 


1 


2304 


GTRQLPPPSPGSGPGDSPE6PEGEAPERRRKAHGMLKLYYGLSE 

geaagrpagpdpldptdlngahfdpevyldklrrecplaqiimds 
etdmvrqiraldsdmqtlvyenywkfisatdtirkmkndfrkme 

DEMDRLATNMAVITDPSARrSATLQDRHERITKLAGVHAIiLRKL 
QFLFELPSRLTKCVELGAYGQAVRYQGRAQAVIiQQYQPJLPSFRA 
IQDDCQVITARLAQQLRQRFREGGSGAPEQAECVELLIiAIiGEPA 
EELCEEFLAHARGRLEKELRNLBAEIiGPSPPAPDVLEFTDHG\S 
SGFVGGLCX3VAAAYQEI*FAAQGPAGAEKIAAFARQLGSRYFALV 
ERRLAQEQGGGDNSIiVIU\LDRFEmRIJiAPGALLA7W3LADAAT 
EIVERVARERLGHHLQGLRAAFLGCLTDVRQAIiAAPRVAGKEGP 

glaeliianvass i lshi kaslaavhlptakevs fsnkpyprgsp 
csqgvreglivgpvhsmcqtaqsfcdspgekggatppallllls 

RLCLDYETATISYI LTLTDEQPLVQDQFPVTPVS TLCAEARETA 

rritlthyvkvqglvisqmlrksvetrdwlstleprnvravmkrv 
vedttaidvqvlprlagvaltqaggtvpsrgagaaedhwqslpg 
ggdmci washqas s varasvrepqgnksprmntkrageclcprs 
csfsaqdydifapilpvekqrlrvtqevraglylvlklrpqtws 
cilplphstgsinsdhvptk 


6148 


3bS6 


353 


VPAVGGTFADGAMGEAEKFHYIYSCDLDINVQLKIGSLEGKREQ 
KSYKAVLEDPMLKFSGLYQBTCSDLYVTCQVFMGKPIiALPVRT 
SYKAFSTRWNWNEWLKLPVKYPDLPRNAQVATiTIWDVYGPGKAV 
PVGGTTVSLFGKYGMFRQGMHDLKVWPNCRSQMDQKPTKTPGRT 
SSTIiSEDQMSRLAKLTKAHRQGHMVKVDWLDRLTFREIEMINES 
VKRSSNFM YLMGGFRCVKCDDKEYGI VY YEKDGDBS SP ILTS FB 
LVKVPDPQMSLENLVESKHHNLPRSLRSGPSDHDLKPYPSPRDQ 
LKNlVSYPPSKPPTYEEQDLVWEPRYYLTNQDKALTKILrSVIW 
DIjPQGAKQALALLGKWKPMDVEDSLELLSSHYTNPTVRRYAVAR 
LRQADDEDLLMYLLQLVQALKYENFDDIKKfGLEPTKKDSQSSVS 
ENVSNSGINSAEIDSSQ I IT/SAP FPSVSSPPP\ASKTKEVPDG 
ENLEQDLCTFLISRASKNSTLANYLYWYVIVECEDQDTQQRDPK 
THEMYLNVMRRFSQALLKGDKSVRVMRSLLAAQQTFVDRLVHLM 
KAVQRESGWRKKKNERLQALLGDNEKMNLSDVELIPLPLEPQVK 
IRGIIPETATLPKSALMPAQLFFKTBDGGKYPVIFKHGDDLRQD 
QLILQIISLMDKLLRKENIiDLKLTPyKVIATSTKHGFMQPIQSV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepti"^^ 
(A«Alanine, C-Cyateine, D=Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L-Leucine, M=Methionine, N:=Asparagine, 
P=Proline, Q^Glutamine, R^Arginine, 
S*Serine, T=Threonine, V^Valine, 
W=Tryptophan, y==Tyrosine, XaUnknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6149 






PVAEVLDTBGSIQNFFRKYAPSENG&NGISAEVMUTYVKSCAGY 
CVrTYILGVGDRHLDNLLLTKTGKLFHIDFGYILGRDPKPLPPP 
MKLNKEMVEGMGGTQSEQYQEFRKQCYTAPLHLRRYSNLILNLF 
SLMVDAWIPDIALEPDKTVKKVQDKFRLDLSDEEAVHYMQSLID 
ESVHALFAAWEQIHKPAQYWRK 


6150 


1 


1413 


RVDPRVRBNGTANPIKNGKTSPASKDQRTGKKTSVQGQVQKGND 
ESESDFESDPPSPKSSEEEEQDDEEVLQGEQGDPNDDDTEPENL 
GHRPLLMDSEDBEEEEKHSSDSDYEQAKAKYSDMSSVYRDRSGS 
GPTQDIiNTIJjLTSAQLSSDVAVETPKQEFDVFGAVPFFAVRAQO 
PQQEKNEKNLPQHRFPAAGLEQEEPDVFTKAPFSKKVHVQECHA 
VGPEAHTIPGYPKSVDVPGSTPFQPFLTSTSKSESNEDLFGIiVP 
FDEITGSQQQKVKQRSLOKLSSRQRRTKQDMSKSNGKRHHGTPT 

stfoctlkptyrtperarrhkkvgrrdsqssnbfltisdskenis 

VALTDGKDRGNVIiQPEESIiLJDPFGAKPFHSPDVLSWHPPXhQGL 
S \D IRflDHNT\VLPGR \ PRQNSLHGSFHSADVLKMDDFGAVP / P 
LTELWQSITPHQSQQSQPV\ELDPFGAAPFPSKQ 


6151 


372 


37 


MSNIKKYIIDYDWKAS^ErEIDHDVMTEEKLHQINNFWSDSEYR"" 
LNKHGSVLNAVLIMLAQHALLIAISSDLNAYGWCEFDWNDGNG 
QEGWPPMDGSEGIRITDIDTSGI F 


6152 


1555 


521' 


DSNQQSVSGTAASTLLHSFKATIYYQGTGHVQQFYGVTSPYSQT"' 
TPPIVQSYAQPSIjQYIQGCXilFTAHPnGrvwnpanTiA/rpn. r i/a nn 

QPQPLQPSBMWTNNIiLDXiPPPSPPKPKTIVLPPNWKTARDPEG 
KIYYYHVITRQTQWDPPTWESPGDDASLEHEAEMDLGTPTYDEN 
PMK\ASKKPKTAEADTSSELAKKSKEVFRKEMSQPXVQCLNPYR 
KPDCKVG\RITTTEDPKHIARKLTHGVMNKBLKYCKNPE\DLEC 
NENVKHKTKEYIKKYMQKFGAVYKPKEDTEFRVTVGPGWEDGWS 
GKTDSRBRKSCGPFCSTPVSTVLLMIHHPGEFNPADVN 


6153 


1366 


648 


NRTW5TPSTWMGVALPPLCSTGPWPVTRQITARTTCGAVPAKCP 

PWC/DVHEPRCQPPDCHGHGTCVDGHCQCTGHFWRGPGCDELDC 

GPSNCSQHGLCTETGCRCDAGWTGSNCSEECPLGWHGPGCQRPC 

KCEHHCPC33PKTGNCSVSRVKQCL0PPEATLRAGELSFFTRTAW 

LALTLAIAFLLLISTAANLSLLLSRAERNRRLHGDYAYHPLQEM 
NGEPLAAEKEQPG6AHNPFKD 




2 


3368 

] 

^ 


GRVGARSPGRAYAliLLLLICFNVGSGLHLQVLSTRNENiCLLPKH 

PHLVRQKRAWITAPVALLEGBDLSKKNPIAKIHSDIiAEERQLKI 

XYKYTGKGITEPPFGIFVFNKDTGELNVTSILDREETPFFLLTG 

YAIjDARGNNVEKPIjELRIKVX,DrNDNEPVFTQDVFVGSVEELSA 

AHTLVMKINATDADEPNTLWSKISYRIVSLEPAYPPVFYLNKDT 

GEXYTTSVTLDREEHSSYTLTVEARDGNGEVTDKPVKQAQVQIR 

rLDVWDNIPWENKVLEGMVEBNQVNVEVTRIKVFDADBIGSDN 

WLANFTPASGNEGGYFHIETDAQTNEGIVTLIKEVDYEEMKNLD 

PSVIVANKAAFHKSlRSKYKPTPIPIKVKVKNVKEGrHFKSSVI 

SIYVSESMDRSSKGQIIGNFQAFDEDTGLPAHARYVKXEDRDNW 

ISVDSVTSEIKLAKLPDFESRYVQNGTYTVKIVAISEDYPRKTI 

TGTVLIKTVED INDNC PTLIE P VQTl CHDASYVNVTAEDLDGH PN 

SGPFSFSVIDKPPGMAEKWKIARQESTSVLLQQSEKKLGRSEIQ 

PLISDNQ6FSCPEKQVLTLTVCEVLHGS\GCREAQHDSYVGLGP 

AAIALMILAPLLLLLVPLLLLMCHCGKGAKGFTPIPGTIEMLHP 

MNNEGAPPEDKWPSPLPVDQGGSLVGRNGVGGMAKEATMKGflS 

SASIVKGQHEMSEMDGRWEEHRSLIiSGRATQFTGATGAlXMTTB 

rriTARATGASRDVAGAQAAAVALNEEFLKNYFTDKAASYTEED 

BNHTAKDCLLVYSQEETESLNASrGCCSFIEGELDDRPr,DDLGL 

KFKTIiAEVCLGQKIDINKEIEQRQKPATETSMNTASHSLCEQTM 

tWSENTYSSGSSFPVPKSLQEANAEKVTQEIVTERSVSSRQAQK 

/ATPLPDPMASRNVIATETSYVTGSTMPPTTVILGPSQPQSLIV 

PERVYAPASTLVDQPYANEGTVWTERVIQPHGGGSNPLEGTQH 
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SHQ 
XD 
NO: 



S154 



Predicted " 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



3660 



Predicted end 
nucleotide 
location 
CO r re s pondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 



2146 



6155 



869 



6156 



5725 



6157 



946 



6158" 



S159 



53 



14 82 



84 



€160 



1626 



1790 



Ammo acid segment containing signal peptide 
(AoAlanine, C=Cy3teine, D=Aspartic Acid, E=: 
Glutamic Acid, F« Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Prolin6, Q=Glutamine, R-Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, X^Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\=pos3ibae nucleotide insertion) 
liQDVPyVMVRERSSFriAPSSGVQPTLAMPNlAVGQWVTVTSRVL 

AP AS TT /^SSVO T DTTJNTC MT a TSM'n'm ro ^ iv t*-.^ « 



APASTLQSSYQIPTENSMTARNTTVSGAGVPGPLPDKSLESSGH 
SNSTITTSSTRVTKH5TV QHSYS 

KKKTKMKNTLQKTVNFGAWPKPTI SDKSHLLQM VSKIJ>LTDAKN 
SDTAHIKSIEITSILNGLQASESSAEDSEQEDERGAQDMDNNGK 
EESKIDHLTNNRNDLISKEEQNSSSLLEEKKVHADLVISKPVSK 
SPERLRKDIEVLSEDTDYEEDEVTKKRKDVKKDTTDKSSKPQIK 
RGKRRYCNTEECLKTGSPGKKEBKAKNKESLCMEWSSNSSSDED 
EEETKAKMTPTKKYNGIiEEKRKSLRTTGFYSGFSKVAEKRIKLL 
NNSDERLQNSRAKDRKDVWSSIQGQWPKKTIiKELFSDSDTEAAA 
SPPHPAPEEGVAEESLQTVAEEKSCSPSVELEKPPPVNVDSKPI 
EEKTVEVNDRKABPPSSGSNPSA*IPLPyLHLNRLHQSL*QKGS 
RQQSSVTVSEPLAPNQEEVRSIKSETDSTIEVDSVAGELQDLQS 
ERE* LASRF* CQCELEQ* * S ARTRTS * KSLYRSBKSERCSGRRK 
FIKKABPaCP*3N3GKQQKEGK 



HLI,PELRGK$WITWKYVFyLGVIAGTFFFADSSVQKEDPAPYt\7— 
YLXSHFNPCVOVI,rKPSWVLAPAHCYLPNLKVMLGNFKSRVRDG 



^lE 



- * xi^wtvB-^^j. v^-ui^i^ijijwamfiJSSGRHPDLRQNIiEAPVMSD 
CQKTEQGKSHRWSLCVKFVKVPSRlFGEVAVATVrOCDKLQG: 
VGHFMGGDVGIYTNVYKYV3WIENTAKDK 

gtstvtkatkkhfsiilnlxksmllkkdn qdtrkllmtwalevav I 

VMKKSETYAPtFCLPSFHKFCKGIiLADTLVEDyNICLQACSSLH 
ALSS SLPDDLLQRCVDVCRVQL VHRGTCIRQAFGKLLKS I PLG V 
PLSNNNHTEIQEISLALRSHMSKAPSNTFHPQDFSD/VISFILY 
GNSHRTGKOKWLERLFYSCQRLDKRDQSTIERNLLKTDAVLWQW ! 
AIWEAAQFTVLSKLRTPLGRAQDTFQTIEGIIRSLAGHTLMPDQ i 
DVSQWTTADKDEGHGNNQLRLVLLLQYIiENLEKLMyNAYEGCAN \ 
ALTSPPKVIRTFLYTNRQTCQDWLTRIRLSIMRVGLLAGQPAVT 
VRHGFDLLTEMKTTSLSQGNELEVS IMMWEALCELHCPEAIQG 
IAVWSSSIVGKHLLWINSVAQQAEGRFEKASVEYQEHLCAMTGV 
DCCISSFDKSVLTIiASAGCKSASLKHCLNGESRKSVLSKPTDSS 
PEVINYLGNKACECYISTADWAAVQEWQNAIHDLKKSTSSTSLN 

LKADFWYIKSLSSFESGKFVECTEQLBLLPGENINLLAGGSKBK 
IDMKKLLRNM 



WANRGPSYGLSREVQEKIKQKYDADI.E NKI.VDWIILQCAEDIEH 
PPPGRAHFQKWLMDG'miCK:JINSLYPPGQEPIPKISESKMAFK 
QMEQISQFLKAAETYGVRTTDirQTVDLWEGKDMAAVQRTLMAL 
GSVAVTKDDGCYRGEPSWFHRKAQONRRGFSEEQLRQGQNVIGL 
QMGSNKGASC3AGMTGYGMPRQIM*DAASCP 



VfrioOi.>lJVV3trtJigLHtiW i-i^Y[5MPRQXM*Dj\a>SCP 

liGSLIVLSLHCKVIPSSQSLERAMKEKAVOLVPlLAQNPGIAQN 
PILEGKDHNQNTGVDPIIDHVQDRKTD/SRSKSPHKKRSKSRER 
RKSRSRSHSRDKRKDTREKIKEKERVKEKDREKEREREKEREra; 
KBRGKNKDRDKBREKDRBKDKEPCDREREREKEHEKDRDKEKEKE 
QDKEKERBKDRSKEIDEKRKKDKKSRTPPRSYNASRRSRSSSRE 
RRRRRSRSSSRSPRTSKTIKRKSSHSPSPRSRNKKDKKREKERD 
HISERRERERSTSMRKSSNDRDGKEKLEKNSTSLKEKEHNKEPD 
SSVSKEVDDKDAPRTBENKIQHNGNCQLNEENLSTKTEAV 
AVIAPLHISLGDRARPYLKNTEKSSTTCSRKKNQyFPPVMSLTH 
RLHLCKYWGGAVSNVCRFWEGRPLPLMIWPVTLPVSriPVGSCV 
IXTGTPILTFVKDPQLEVNFYTGMDBDSDIAFQFRLHFGHPAIM 
NSCVFGIWRYEEKCYYLPFEDGKPFELCIYVRHKEYKVMVNGQR 

IYNFAHRFPPASVKMLQVFRDISLTRVLISD*GRCVRITAVQEF 
DVSVSCDCTTAY QPG 

AGAKgFP*F*KVADAQPTBSEKEiyNQ VKWLKDAEGILEDLOS ' 
YRGAGHEIREAIQHPADEKLQEKAWGAWPLVGKLKKFYEFSQR 
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SEQ 
ID 
HO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A«Alanine, C=Cysteine, D<=Aspartic Acid, E=* 
Glutamic Acid, F-Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L=Leucine, M^Methionine, N«Asparagine, 
PssProline, Q=Glu tannine, R=.Arginine, 
S=Serine, T=:Threonine , V=Valine, 
WisTryptophan^ Y=Tyrosine, X=unknown, *=Sfcop 
Codon, /spossible n^ucleotide deletion, 
\sspossible nucleotide insertion) 








IiEAALRGXjLOALTSTPYSPTQHLEREQAIjAKQFAEIIjHFTLRFD 
ELKMTNPAIQNDFS YYRRTLSRMR INNVFAEGENEVWNELANRM 
S LFYAEAT PMLKTLS DATTKFVSENKNLP I ENTTDCLSTMASVC 
RVMIiETPEYRSRFTNEETVS FCLRVMVGVI ILYDKVHPVGAFAK 
TSKI DMKGCI KVLKDQP PNSVEGLLNALRYTTKHLNDETTSKQI 
KSMLQ*QLLTLVNKG 


6161 


455 


1569 


PVSGSES SLRRAWAS I LRLMLGPRVAVS ILCEDGISH* LLEKH* 
KSHVLEPLSSIiALEEQCIiALSLDWSTGKTGRAGDQPLKirSSDS 
TGQLHLLMVNETRPRLQKVASWQAHQFEAWIAAPNYWHPEIVYS 
GGDDGLLRGWDTRVPGKFLFTSKRHTMGVCS I QSSPHREHILAT 
GSYDEHILLWDTRNMKQPLADTPVQGGVWRIKWHPFHHHIiIiLAA 
CMHSGFKILNCQKAMEERQEATVLTSHTLPDSLVYGADWSWLLF 
RSLQRAPSWS FPSNLGTKTADLKGAS ELPTPCHECREDWDGEGH 
ARPQSGMKPLTEGMRKNGTWLQATAATTRDCGVNPEEADSAFSL 
LATCS FYDHALHLWE WEGN 


" 61^2 


1 


586 


RTIHATGRAGASPMHRLIVWRIiAEANKQHVRCQKCIiEFGHWTYE 
CTGKRKYIiHRPSRTAELKKALKEKENHLIjLQQSIGETNVERKAK 
KKRSKSVTSSSSSSSDSSASDSSSESEETSTSSSSEDSDTDESS 
SSSSSSASSTTSSSSSDSDSDSSSSSKQ*HQHR*OL*R*TTKEE 
EKEIELLHSYWTDGLKTLM 


6153 


1081 


785 


R IRSTTBGCTVVRIiHPTQNTQECARIMI LIjSVS LGRHWAFTYKFFL 

tpwfvffffffhrke * vmqknpmksredewmeklnnlhvqrad ! 
mnrli mnyijVTegfkeaaekfrmesg ibpsvdijEtlderi ki re 

MILKGQIQEAIALINSLHPELIjDTNRYLYFHLQQQHLIELIRQR 

etkaalefaqtqiiaeqgeesrecltemertiiaiiiapdspeespf 
gdllhtmqrqkviflsevnqavldyenrestpkiakllklllwaqn 
eldqkkvkypkmtdlskgvieepk 


6164 






PCQS PGRSRMRQDKIiTGSLRRGGRCLKRQQGGVGTI LSNVLKKR 
SCISRTAPRLLCTLEPGVDTKIjKFTLEPSliGQNGFQQWYDALKA 
VARLSTGI PKEWRRKVWLTIiADHYLHS lAIDWDKTMRFT FNERS 
NPDDDSM6IQ I VKDLHRTGCS SYCGQEAEQDRWliKRVLIAYAIi 
WNKTVGYCQGFNIIiAALI LEVMEGNEGDALKIM I YL IDKVLPES 
YFVim.RALSVDMAVPRDIJ^MKLPELSOHLDTLQRTANKESGG 
GYEPPLTNVPTMQWPLTLFATCLPNQTVLKIWDSVFFEGSE I IL 
RVSIAIWAKLGBQIECCETADEFYSTMGRLTQEMLENDLIiQSHE 
LMQTVYSMAPFPFPQrAELREKYTYNirPFPATVKPTSVSGRHS 
KARDSDEENDPDDEDAWNAVGCIiGPFSGFLAPEliQKYQKQIKE 
PNEEQ SliRSNNIAELSPGAINSCRSE YHAAFNSMMMERMTTD IN 
ALKRQYSRIKKKQQQQVHQVYIRADKGPVTSILPSQVNSSPVIW 
HLLIiGKKMKMraRAAKmVIHIPGHTGGKISPVPYBDLKTKLIirs 
PWRTHIRVHKKNMPRTKSHPGCGDTVGLIDEQNEASKTNTGLGAA 
EAFPSGCTATAGREGS S PEGSTRRTlEGQS PEPVFGDADVDVSA 
VQAKLGAI^LNQRDAAAETELRVHPPCQRHCPEPPSAPEENKAT 
S KAPQGSNSKTPI FSP FPSVKPLRKSATARNLGLYGPTERTPTV 
HFPQMSRSFS KPGGGNSGP * KMVFSSGTMLSRQLPGYPQEYQRN 
GGERFG 


6165 


90 


406 


PCQS PGRSRMRQDKLTGSLRRGGRCLKRQGGGVGTI LSN VLKKg~ 

SCISRTAPRLLCTLEPGVDTKLKFTLEPSLGQWGFQQWYDALKA 

VARLSTGI PKSWRRKVWLTLADHYUIS I AIDWDKTMRPTPNERS 

NPDDDSMGIQIVKDLHRTGCSSYCGQEAEQDRWLKRVLLAYAR 

WNKrVGYCQGFC?ILAALrLEVMEGNEGrALKIMIYLIDBCVI,PES 

YFVNNLRALSVDMAVFRDLLRMKLPELSQHLDTLQRTANKESGG 

GYEPPLTNVFTMQWFLTLFATCLPNQXVLKIWDSVFFEGSEIIL 

RVSIiAIWAKLGEQIECCETAEEFYSTMGRLTQEMLENDLLQSHE 

LMQTVySMAPFPFPQI»AELREKYTYNITPFPATVKPTSVSGRHS 

KARDSDEENDPDDEDAWNAVGCLGPFSGFIAPELQKYQKQIKE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
tA=Alanine, C^Cysteine, D=Aspartic Acid, E= ' 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L= Leucine, M=*Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R^Arginine, 
S=Se rine, T=Threonine, VisValine, 
W^Tryptophan, Y=Tyrosine, X«Unknown, *=Stop 
Codon, /s=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PNEEQSIiRSNNIAELSPGAINSCRSEYHAAFNSMMMERMTTDIN 
ALKRQYSRIKKKQQQQVHQVYIRADKGPVTSI LPSQVNSS PVIN 
HLLLGKKMKMTNRAAKNAVIHiPGHTGGKISPVPYEXiLKTKriWS 
PWRTHXRVHKKNMPRTKSHPGCGDTVGLIDEQtJEASKTHGLGAA 
EAFPSGCTATAGREGSSPEGSTRRTIEGQSPEPVFGDADVDVSA 
VQAKLGALELNQRDAAAETELRVHPPCQRHCPEPPSAPEENKAT 

HFPQMSRSPSKPGGGNSGP * KMVFSSGTMLSRQLPGypQEYQRN 
GGERFG 


6166 


2 


1206 


HKLWRTVAMAGAEWKSLEECLEKHLPLPDLQEVKRVLyGKELRK 
LDLPREAFEAASREDPELQGYAFEAAEEQLRRPRIVHVGLVQNR 
IPLPANAPVAEQVSALHRRIKAIVEVAAMCGVNIICFQEAWTMP 
FAFCTREKLPWTEFAESABDGPTTRFCQKIiAKNHDM\AArSPILE 
RDSEHGDVLWNTAWISWSGAVLGKTRKNHIPRVGDFNESTYYM 
EGNLGHPVFQTQFGRIAVNICYGRHHPLNWLM YS INGAE II FNP 

GDGKKAHQDFGYFYGSS YVAAPDSS RTPGLSRSRDGLLVAKLDL 

NLCQQVNDVWNFKMTGRYEMYARELAEAVKSNYSPTIVKE+PAS 
VPALG 


6167 " 


1220 


1844 


YGIVTGPSLCAGDKQPKKQEKNPVLVSPEFVDEALCACEEYLSN 
t*AEMD IDKDLEAPLYLTPEGWSLFLQRYYQWHEGAELRHIiDTQ 
VQRCEDI LQQLQAWPQI DMEGDRNI WI VKPGAKSRGRG IMCMD 
HLEEMIiKLVNGNPWMKDGKMWQKy I BRPLLI FGTKFDLRQWF 
LVTDWWPLTVWPYRDSYIRFSTQPPSLKNLDK*APLYLTPEGWS 
LPLQRYYQWHEGAELRHLDTQVQRCEDILQQLOAVVPQIDMEG 

VQKYI ERPLLI FGTKFDIiRQWFLVTDWNPlUTVWFYRDS YI RPST 
QPFSLKNIiDK 


6168 


84 


1392 


VWPVPSVSAMPPKKQAQAGGSKKAEQKKKEKIIBDKTFGLKNKK 
GAKQQKFIKAVTHQVKFGQQNPRQVAQSEAEKKLKKDDKKKELQ 

RT.MWT.Try P Wi 2V rtlf T ^\cn TV n o btg xrxrr* » cc vru^/^ r"r*vr^ fi rrr^r^ryfy t t 
ciML^Bjjf *v.ir V vrt^Kyix.xoAijMJUt'A.i V VlJ>iJi? B K^CsUL. X KGDKCKFSH 

DLTLERKCEKRS VYIDARDEELEKDTMDNWDE KKLEEWNKKHG 
EAEKKKPKTQIVCKHFLEAIBNNKYGWPWVCPGGGDICMYRKAL 
PPGFVLiCKKKKKKKKEDEISIi*DLIERERSALGPNVrKITLESP 
LAWKKRKRQEKIDKLEQDMERRKADFKAGKALVISGREVFBFRP 
ELVNDDDEEADDTR YTQGTG6DE VDDSVS VND I DLS U YI PRDVD 
ETGITVASLERFSrYTSDKDEWKLSEASGGRAENGBRSDLEEDKr 
EREGTENGAIDAVPVDENLFTGEDLDEIiEEELNTLDLEE 


6169 


112 


662 


APAAAMAERPEDLNLPNAVITRI I KEALPDGVNIS KEARSAISR 
AASVPVLYATSCANNFAMKGKRKTLNASDVIiSAMEBMEFQRFVT 
PLKEAIiEAYRREQKGKKEASEQKKKDKDKKTDSEEQDKSRDEDN 

DEDEBRLEESEQNEEEEVDN*KGRETVAPWrKVPLEMRRATCFCE 
AFPCWAE 


6170 


62 


667 


STKVMLPNTGRLAGCTVFITGASRGIGKAIALKAAKDGANIVIA " 
AFCTAQPHPKLLGTI YTAAEE lEAVGGKALPCI VDVRDEQQISAA 
VEKAIKICPGGIDILVNWASAISLTNTLDTPTKKLDt/MMNVNTRG 
TYLASKACIPYLKKSKVAHIPNISPPLNLNPVWFKQHCGRW*W 
G * GDGLCLICFELNIiCMSDV ITI CT 


6171 


382 


941 


HFMQSDVELDCDIEPCGHTKFPPTLPLSTTVIVCSCHPVATAST 
MAEAFSKTTSEEDQSIQEPKEANSMTAQKQKK*GLRGSRRRHAN 
SGGDI FGDSFAAYFPRVLKQVHQALSLSQEAVSVMDSMVRDI LD 
RIATEAGHLAHYSXCVTITSRDIRMAVCtiLLPGKMGKLAESQGT 
NATLRYTKSK 


6172 


651 


54 


GLCRAGGAHRFSRTHVEAALKMLRREARIiRREYLYRKAREEAQR " 
SAQERKERLRRAIiEENRLI PTELRREALALQGSIiEFDDAGGEGV 
TSHVDDEYRWAGVEDPKVMITTSRDPSSRLKMFAKELKLVFPGA 
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SEQ 
ID 
NO: 


Predicted 
beginnincf 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence ' 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide"" 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F»Phenyl alanine^ G=Glycine 
H^^Histidine, I=Isoleucine, K=Lysine, 
L-Leucine, M=Methionine, N^Asparagine, 
P=Proline. Q=Glutamine, R=Arginine, 
S=Serine, T=: Threonine, V«Valine, 
W=Tryptophan, y=Tyrosine, X^Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\«possible nucleotide insertion) 








QRMNRGRHEVGALVRACKANGVTDLLWHEHRGTPVGLIVSHLP 
FGPTAYFTIjCNWMRHDIPDLGTMSEAKPHLITHGFSSRLGKRV 
SDILRYIiFPVPKDDSHRVITFANQDDYISFRHHVYKKTDHRNVE 
LTEVGPRFELKLYMI RLGTLEQEATADVE WRWHPYTNTARKRVF 
LSTE*AAPRPLGQLL 


6173 


3 


288 


SVDHREVQVLSQSMPLTPHQAVLRGERPYMCVECGKCFGRSSHIi 
LQHQRIHTGEKPYVCSVCGKAFSQSSVLSKHRTIHTGEKPYECN 
ECGKAFRVSSDLAQHHKIHTGEKPHECLECRKAFTQLSHLIQHQ 
RIHTGERPYVCPLCGKAFNHSTVLRSHQRVHTGEKPHRCNECGK 
TFSVKRTLLQHQRIHTGSKPYTCSECGKAPSDRSVLIQHHNVHT 
GEKPYECSECGICrFSHRSTLMNHERIHTEEKPYACYECGKAFVQ 
HSHIiIQHQKVHRKL* PTCVLS VGSAIAGVPTS PS IS VSTLERS P 
MCAVYVGRPSARAQSLVNTGQFTQVRSPMSVMSVEKPLE 


6174 


1060 


959 


PRPPQKRWMVAGLGNPGLPGTRHSVGMAVLGQLARRLGVAESWT 
RDRHCAADLALAPLGDAQLVLLRPRRLMNANGRSV/OiAAELPGL 
TAEEVYLVHDELDKPLGRLxALKLGGSARGHNGVRSCISCLNSNA 
MPRLRVGIGRPAHPEAVQAHVXGCFSPAEQELLPLLLDRATDLI 
LDHIRERSQGPSIjGP*H*WFSKKA 


6175 


2204 


334 


ryfradprsrsgqpraisglgafaegplramaapvkgnrkqsteg " 
daldppaspkpagkqngiqnpisledspeaggereeeqereeeq 

APLVS tiY KFMKERHT P I ERVPHLGFKQINLWKI YKAVEKLGAY E 
LVTGRRIiWiCKVyNELGGSPGSTSGATCTRRHY*RLVLPYV3mLK 
GBDDKP]:lPTSKPRKQYK^aAKEISrRGDDGATBRPKICAKEERRMDQM 
MPGKTKADAADPAPLPSQEPPRNSTEQQGLASGSSVSPVGASGC 
PEAYKRLLSSPYCKGTHGIMSPLAKKKLLAQVSKVEALQOQEEQ 
*f i^A vmji^ FU b PKGLi TENSRHRLTPQEGLQAPGG 
SLREEAQAGPCPAAPIFKGCPYTHPTEVLKPVSQHPRDFFSRLK 
DGVLLG PPGKEGLS VKEPQLVWGGDANRPSAFHKGGSRKGI LYP 
KPKACWVSPMAKVPAESPTIiPPTFPSSPGLGSKRSLEEEGAAHS 
GKRIiRAVSP FLKEADAKKCGAKP AGSGtiVQ rt.T .«pa T /imrD d t? a 

YRGTMLHCPLNFTGTPGPLKGQTJVLPFSPLVIPAFPAHPIATAG 
PSPMAAGLMHFPPTSFDSALRHRLCPASSAWHAPPVTTYAAPHF 
FHLNTKL 


6176 


1040 


402 


PLSALRAMAEVHVIGQI IGASGFSESSLFCKWGIHTGAAWKLLS 
GVREGQTQVDTPQIGDMAYWSHPIDZJIFATKGLQGWPRLHFQVW 
SQDSFGRCQIiAGYGFCHVPSSPGTHQIiACPTWRPLGSWREQLAR 
AFVGGGPQLLHGDTIYSGADRYRLHTAAGGTVHLEIGLJ0L.R1TFD 
RyGVEC*GTLPPTSPPSTPRTPSDGGGWHSGQEHRL 


6177 " 


1400 


992 


VPIESLVGiCVHNFPLIAFYCCSKGKRQPHKSLHDRCFGEALDPN 
CSHCYLDQI KRSDFLGFSGYS PHFVAISTMSEHKMQPSSMQQAL 
PSQ*PYWTDPRPAIiVPCCSHRPDVHRSRPGPGLPGTSGCSDRPP 
VCPI 


6178 


1027 


254 


STQRGG I KGVARAAS LVGRRRAGTGMALLLCLVCLTAAIiAHG CL 
HCHSNFSKKFSFYRHHVNFKSWWGDIPVSGALLTDWSDDTMKE 
LHLAIPAKITRBKLDQVATAVYQMMDQLYQGKMYFPGYFPKELR 
NI FRSQVHL IQNAI I ESRIDCQHRCGI POYETISCNNCTDSHVA 
CFGYNCESSAQWKSAVQG^,LtIYINNWHKQDTS^5RPRSSAFSWPG 
THRAAPAFIjVLPALRCriEPPHliANLSLEDAA*CLKQH 


6179" " 


806 


276 


RGETREMAGNLLSGAGRRLWDWVPIACRSFSLGVPRLIGIRLTL 
PPPKWDRWIJEKRAMFGVYDNI GI LGNFEKHPKELIRGPI WLRG 
WKGNELQRCIRKRKMVGSRMFADDLHNLNKRIRYLYKHFNRHGK 
FR*KRKLRTSEKAHLS PWRRETVLPPVRKRLCI FS VI KWGFFGI 


6180 


156 


1833 


DHHILKAASTTHVCARGNIFAIPNTRCLEC*ATATPSSLECQN* 
SHLSLCPLPATTSGLTPNSMIPEKERQNIAERLLRVMCADLGAL 
SWSGKEPLKLAQTLVDSGARYGAFSVTEILGNFNTLALKHLPR 
MYNQVKVKVTCALGSNACLGIGVTCHSQSVGPDSCYILTAYQAE 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

cor re spon di ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A=Alanine, CoCysteine, D=Aspartic Acid, E~ 
Glutamic Acid^ F= Phenylalanine, G^Glycine, 
H=Histidine^ Islsoleucine, K=:Lysine, 
Ii=Leucine^ M=Methionine, N-Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, X^Unknown, *=.Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








GNHI KS YVLG VKGAD I RDSGDLVHH WVQNVLSEFVMSE IRTVYV 
TDCRVSTSAFSKAGMCrjRCSACALNSWQSVLSKRTLQARSMHE 
VIELLKVCEDLAGSTGLAKBTFGS LEETS PP PCWNS VTDSLI.LV 

LSKESQPTLQLVZjPTYVRLBKLFTAKANDAGTVSKLCHLPLEAL 
KEITFKVHPAHKVAMILDPQQKLRPVPPYQHEEIIGKVCELINEV 
KESWAEEADFEPAAKKPRSAAVENPAAQEDDRLGKNEVYDYLQE 
PLFQATPDIiFQYWSCVTQKHTKLAKLAFWLliAVPAVGARSGCVN 
MCEOALLI KRRRLLSPEDMNKLMFLKSWML 


6181 


169 


1032 


TRTLLSPVLLPGPRWKPV^RRRPMdPLAtPAWLQPRYRKNAYLFI 
YYLIQFCGHSWI FTNMTVRPPS FGKDSMVDTFYAIGLVMRLCQS 
VSLLEXXH I YVGIB SNHLLPRFLQLTERI 1 1 LPWITSQBBVQE 
KYWCVLPVFVJNLLDMVRYTYSMLSVIGISYAVLTWLSQTLWMP 
IYPLCVriAEAPAXYOSliPY"PE^"F?f;TV^TTfT,PTmT.^TVTi»'DVTrr vt 

YLMMLFIGM YFTYSHLYSERRDI LGI FP IKKKKM* STAFQCDTR 
KDRLW1QC5K+NTGSILVEKFLVF 


6182 


1769 


1224 


AS*IDYQI*NTLL>KEFQLTEENTKLRYLTCiSI,IEDMAAAYFPDCI " 
VRPFGSSVNTFGKLGCDLDMPLDLDETRNLSAHKISGNFLMEPQ 
VKNVPSERIATQKXLSVLGECLDIIFGPGCVGVQKIIiNARCPIjVR 
FSHOASGPOCDLTTNNRIAIjTfiSEIjIjVTYf5aT.rn3RVPaT.vpCTrD 

CWARAHSLTSSIPOAWITNFSrjTMMVIFFLQRRSPPILPTIiDSIi 
KTLADAEDKCVlEGNWCTFVRDLSRIKPSQNTETLELliLKEFFE 
YFGNFAFDKNSXNIRQGREQNKPDSSPLYIQNPFETSLNISKNV 
SQSQLQKFVDIiARES AW I LQQEDTDRPS ISSNRPWGLVS LLLPS 
APNRKSFTKKKSNKFAIETVKNLLESLKGNRTENFTKTSGKRTI 
STQT 


6183 


1118 


452 


HLDRYIKSPGSGSSTPAPP3HLLLYLLHPQSTRTMGCCGCSRGC 
GSGCGGCGSS CGGCGSGCGGOGSGRGGCGSGCGGCSSS CGGCGS 
RCYVPVCCCKP VCSWVPACSCTSCGS CGGSKGGCGSCGGS KGGC 
GSCGCSQSSCCKPCCCSSGCGSSCCQSSCCKPCCCQSSCCVPVC 
CQSSCCKPCCCQSNCCVPVCCQCKI*GSGPRPSGFSCLVKAFLM 
VP 


6184 


1 


2191 


IVTVREEDGAPAVAPPGVWSRANKRSGAGPGGSGGGGARGAEE 
EPPPPLQAVLVADSFDRRFFPISKDQPRVLLPLANVAIilDYTLE 
FLTATQVQETFVFCCWKAAQIKEHLLKSKWCRPTSLNWRIITS 
EIiYRSLGDVLRDVDAKALVRSDFLLVYGDVISNINITRALEEHR 

HFQKTQGLRRFAFPLSLFQGSSDGVEVRYDLLDCHISICSPQVA 
'QLFTDNPDYQTRDDFVRGLLVNEEILGNQIHMHVTAKEYGARVS 
NLHMY SAVCADVI RRWVYPLTPEANPTDSTTQSCTHSRHNI YRG 
PEVSLOHGSILEEMVIULGSGTVIGSNCPITNSVIGPGCHIEPGD 
NWLDQTYLWQGVRVAAGAQIHQSIaLCDNAEVKERVTLKPRSVL 
TSQVWGPNI TL P EGS VI SLH PPDAE EDBDDGE FS DDSGADQE K 
DKVKMKGYNPABVGAAGKGYLWKAAGMNMEEEEELQQNLSWSIiKI 
NMEEESESESEQSMDSEEPDSRGGSPQMDDIKVFQNEVLGTLQR 
GKEENISCDWLVLEINSLKYAYNISLKEVMQVLSHWLEFPLQQ 
MDSPLDSSRYCALLLPLLKAWS PVFRNYI KRAADHLEALAAI ED 
PPLEHEALGISMAKVLMAFYQLEILAEETILSWFSQRDTTDKGQ 
QLRKNQQLQRFIQWIiKEAEEESSEDD 


^185 


791 


44 


PCTS CVLWATLHLPASTRKAPQAECGMI S I TEWQKI GVGITC FG 
IFFILFGTLLYFDSVLIiAFGNLLFLTGLSLIIGLRKTFWFFFQR 
HKLKGTSPI*LGGWIVI>LRWPLLGMFLETYGPPS*LFKGFFPVAF 
GFLGNVCN I PPLGALFRRLQGTS SMV* KTEMSSLNLDHWLKGAK 
RBBWEPP PQS PALTHS PTYPGPPQVQKERNGAEQLTSNPQVDSR 
GCQEAEMQTPRRLGWGWYHTLTLYIiWEEK 


6186 " 


5^9 


238 


VYGIDSSNTNTHGAEERNRKLKKHWKLCHAQSRLDVNGLALKMA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


i'redicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acxd segment containing signal peptide 
(A-Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F«Phenyl alanine, G^Glycine, 
H^Histidine, I=Isoleucine, K=i/sine, 
LsiLeucine, M=Methionine, N=Asparagine, 
P«Proline, Q=Glutamlne, R=Arginine, 
SssSerine, T=Threonine, VssValine, 
WsTryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
_ X^possibla nucleotide insertion) 


6187 






KERKVKNKVKNKADTEEVFNNSPTNQEKMPTSAILPD'SGSVTS 
WIRNQMETtiHSQPHQEENt,CFENSFSLlNLl.PINAVE^TSSQQr 
PNRETSEANKBRRKMTSKSSESNIYSPLTSFITADSELHDIIKD 
LEDCLMVGLHTCGDLAPNTLRIPTSNSEIKbVCSVGCCYHLLSE 
EPENQHKERTQEKWGPPMCHYLKEERWCCGRNARMSACIALERV 
AAGQGLPTESLFVRAVLQDIIKDCYGITKCDRHVGKIYSKCSSP 
LDYVRRSLKKIiGLDESKLPBKI iMNyYEKYKPRMNELKAPNMLK 
WLAPCIETLILLDRLCYLKEQEDIAWSALVKLFDPVKSPRCYA 
VIALKKOQ* FPLKQI I RCISL * DSAGCAEE VSVGDGGPALRDAP 
PSGSRVGSRYD 




1701 


771 


DAWGPETRLARILNPDSFIEPRPGRLPELEATRPHMEPKASCPA 

AAPLMERKPHVLVGVTGSVAALKbPLLVSKLLDIPGLEVAWTT 

ERAKHFYSPQDIPVTLYSDADEMEMWKSRSDPmilDIjRRWADL 

LLVAPLDANTLGKVASGICDmiLTCVMRAV/DRSKPLLFCPAMNT 

AMWEHPITAQQVDQLKAFGYVEIPCVAKKLVCGDEGLGAMAEVG 

TIVDECVKEVLFQHSGFQQS*PGISVMGVPLYSEWVQAKSVKMDV 

GKIGGYPHbLNGGPALSLPRGQACSRLNWTEGPGIjSFFQPGEAA 
A 


6188 
€189 


238 


1534 


KGFVNAGPI*MAELQVSPQWKAPBMSQICLSCGHPSA*GPRWASW 
NIGVPICIRCAGIHRNLGVHISRVKSVNLDQWTQBQIQCMQEMG 
NGKANRLYEAYLPETFRRPQIDPAVEGFIRDKYEKKKYMDRSLD 
INAPRKEKDDKWKRQSEPVPEKKLEPWFEKVKMPQKKEDPQLP 
RKSSPKSTAPVMDLLGLDAPVACSIANSKTSNTLEKDLDLLASV 
PS PSSSGSRKWGSMPTAGSAGS VPSNliNIiFPBPGSKS EE IGKK 
QIiSKDSIIiSLYGSQTPQMPTQAMFMAPAOMAYPTAYPSFPGVTP 
PNSIMGSMMPPPVGMVAQPGASGMVAPMAMPAGYMGGMQASMMG 
VPNGMMTTQQAGYMAGMAAMPQTVYGVQPAQQLQWNLTQMTQQM 
AGMNFYGAKGMMNYGQSMSGGNEQAANQTLSPQMWK 


6190 


1297 


793 


LGEPLiGDLCEIilPGDVQQLQMGEVHPGTGAQGSAAQSVAGEVQL 
TQLSHARQRPSCQQSQLIALDLQHMDISRQPRWQHVQPVARQVQ 
RAQQAQLAEGVAVHLWAGDAWAEVELLQEVGGGKVFAANACDL 

wqdheg;^iaarqatghalqrvivqvrrvqpleal*rvpsglpr 
rvrapmilhwqitgigredfattyfleelniisynritspqvhrd 

AFRKLRLLRSIJJLSGNRLHMLPPGLPRNVHVLKTKRNELAAIAR 
GALAGMAQLRELYIiTSNRljRSRALGPRAWVDLAHIiQLLDIAGNQ 
LTEIPEGLPESLEYLYLQNNKISAVPANAFDSTPWLKGIPLRFN 
KLAVGSWDSAFRRLKHLQVLDIBGNLEFQDISKDRGRLGKEiCE 
EEEEDEVEEEETR 


6191 


66 


1 AO 


II;VGir\^SFLLSFAEyVCNCSWGSLJJVNRCNQTTGQCECRPGYQ 

glhcetckegfy:*nytsglcqpcdcsphgalsipckssgkcqck 
VGVIGS icdrcqdgyygfskngclpcqcnnrsascdaltgacln 
oqenskgnhceeckegfyqspdatkeclrcpcsavtstgscsik 

SSELEPECDQCKDGYIGPNCNKCENGYYNFDSICRKCQCHGHVY 
P VKTPK I CKPE SGE CINCLHNTTG PWCENCIj* G YVKDt .urswr^r v 

KVILPTPEGS TILVSNASLTTS VPTPVINSTFTPTTLQTI FS VS 
TSENSTSALADVS WTQFNI 1 1 LTVI II VWLLMGFVGAVYMYRE 
YQNRKLNAPFWTIELKEDNISFSS YHDS I PNADVSGLLEDDGNE 
VAPNGQLTLTTPIHNYKA 


6192 


1212 


1511 


VNLCHGGLLHLSTHHI/3IKPSMH*LFPLMLSFPHLTPQQPKCPS"~ 

MIDWIKKIWYIYTMEYYATIKRNEIMFPAGTWMEMEAIILSKLM 
QDYMFSLISGS 




3 


950 

] 


rRGCGNKMAGKKWVLSSIiA\nrAEDSEPESDGEAGXEAVGSAAE^ 
fCGGLVSDAYGEDDFSRLGGOEDGYEEEEDENSRQSEDDDSETEK 
PEADDPKDNTEAEKRDPQELVASFSBRVRNMSPDEIKIPPEPPG 
^CSNHLQDKIQKLYERKIKEGMDMNYI lORKKEFRtTPSIYEKLJ 
SPCAIDELGTNYPKDMFDPHGWSBDSYYEAIAKAQKrEMDKLEK 
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S3Q 
ID 
KO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


£rxt;:uxu.uCG enci 

nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(As:Alanine, C«Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F«Phenyl alanine, G=Glycine, 
H^Histidine, Ii=Isoleucine, K=Lysine, 
L= Leucine, K=Methionine, N=Asparagine, 
P« Proline, Q=GZutamine, R==Arginine, 
S=Serine, T=Threonine, V=Valine, 
W= Tryptophan, Y=Tyrosine, X= Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AKKERTKIEFVTGTKKGTTTNATSTTTTTASTAVADAQKRKSKW 

DSAIPVTTIAQPTILTrrATLPAWTVTTSASGSKTTVISAVGT 
IVKKAKQ 


6193 


3 


950 


TRGCGNKMAGKECNVLSSLAVyAEDSSPESDGEAGIEAVGSAAEB " 

KGGLVSDAYGEDDFSRLGGDEDGyE3BEDEUfSRQSEDI>DS3TEK 

PEADDPKDNTEAEKRDPQELVASFSERVRKMSPDEIKIPPEPPG 

RCSNHJLQDKIQKLYBRKIKBGMDMNyilQRKKEFRNPSIYEKLI 

QFCAIDELGTNYPKDMFDPHGWSEDSYYEALAKAQKIEMDKLEK 

AKKERTKIEFVTGTKKGTTTNATSTTTrTASTAVADAUKRKSKW 

DSAIPVTTIAQPTILTTTATLPAWTVTTSASGSKTTVISAVGT 

IVKKAKQ 


6194 


3 


950 


TRGCGNKMAGKKNVLS SLAVYAEDS EPESDGEAG I EAVGSAAEE 

KGGLVSDAYGEDDFSRLGGDEDGYEEEEDENSRQSEDDDSETEK 

PEADDPKDNTEABKRDPQELVASFSERVRNMS PDE IKI PPE PPG 

RCSNHLQDKIQKLYBRKIKEGMDMNYriQRKKEFRNPSIYEKLI 

QPCAIDELGTNYPKDMFDPHGWSEDSYYEALAKAQKIEMDKLEK 

AKKERrKIEFVTGTKKGTTTNATSTTTTTASTAVADAQKRKSKW 

DSAIPlOTIAQPTILTTTATLPAWTVTTSASGSKTrviSAVGT 
IVKKAKQ 


6195 


736 


235 ■ 


VANGLQSNMPKFYCDYCDTYLTHDSPSVRECTHCSGRKHKENVKb 
YYQKWMEEQAQSLIDKTTAAFQQGKIPPTPFSAPPPAGAMIPPP 
PSLPGPPRPGMMPAPHMGGPPMMPMMGPPPPGMMPVGPAPGMRP 
PMGGHMPMMPGPPMMRPPARPMMVPTRPGMTRPDR 


6196 


1512 


623 


KTGKRRSAAYVRNIIaDNAEQVISKLEARNLGPRLTPLLQEEDSlT" 

QRLLMGLMVSELKDHPLRHLQGVEKKKIEQMVLDYISKLLDLIC 

HIVETWWRKKNLHSWVLHFNSRGSAAEFAVFHIMTRILBATMSt, 

FLPLPPGFHTLHTILGVQCLPLHNLLHCIDSGVLLLTETAVIRL 

MKDLDNTEKKEKLKFSIIVRLPPLIGQKICRLWDHPMSSNIISR 

NHVTRLLQNYKKQPRNSMINKSSFSVEFLPLNYPIErLTDIESS 

NQALYPPEGHDNVDAEFVEEAALKHTAMLLGL 


6197 


3 


819 


ADPEGTESAVMSRYTRPPNTSIiFIRNVADATRPEDLRREFGRYG 
PIVDVYIPIjDFYTRRPRGFAYVQFEDVRDAEDALYNUJRKWVCG 

rqikiqfaqgdrktpgqmkskbrhpcspsdhrrsrspsqrrtrs 
rssswgrnrrrsdslkesrhrrfsysqsksrskslprrstsarq 
srtprrwfgsrgrsrskslqkrsksigksqssspqkqtssgtks 

RSHGRHSDSIARSPCKSPKGYTNFETKVQTAKHSHFRSHSRSRS 
YRHKNSW 


6198 


111 


1912 


SEAALSPSFISPACFLLRKLPALEDGTLPHPDTJLGMNYEGARSB 
RENHAADDSEGGALDMCCSERI.PGLPQPIVMEALD2AEGLQDSQ 

remppppppsppsdpaqkppprgagshsltvrsslclfaasqfl 

LACGVLWFSGYGHIWSQNATNLVSSLLTLLKQLEPTAWLDSGTV7 
GVPSLLLVFLSGGLVLVITLVWHLLRTPPEPPTPLPPEDRRQSV 
SRQPSFTYSEPJMEEKISDDFIiDLDPVPETPVFDCVMDIKPEADP 
TS liTVKSMGLQERRGSNVSLTLDMCTPG CNEEGFGYLMS PRESS 
ARE YLLSASRVLQAEELHEKALDP FliQAEFFE IPMNF VDPKE Y 
DI PGLVRKNRYKTI LPNPHSRVCLTSPDPDDPLSSYINANTYIRG 
YGGEEKVnr r ATQGPI VSTVADF WRMVMQEHTP X I VMITWI EEMN 
EKCTEYWPEEQVAYDGVEITVQKVIHTEDYRLRLISLKSGTEER 
GLKHYWFTSWPDQKTPDRAPPLLHLVREVEEAAQQEGPHCAPII 
VHCSAGIGRTGCFIATSICCQQLRQEGWDILKTTCQLRQDRGG 
MIOHCEQYQFVHHVMSIiYEKQLSHQS PE 


6199 


144 


1211 


MARENGBSSSSWKKQAEDIKKIFEFKETLGTGAFSEWLAEEKA 
TGKLFAVKCIPKKALKGKESSIENEIAVLRKIKHENIVALEDIY 
ESPNHLYLVMQLVSGGELFDRIVEKGFriEKDASTLIRQVLDAV 
YYLHRMGIVHRDLKPENLLYYSQDEESKIMISDFGLSKMEGKGD 
VMSTACGTPGYVAPEVLAQKPYSKAVDCWSIGVIAYILLCGYPP 



464 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beef i nn t m-r 

nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
Eetjuence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine^ D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, lalsoleucine, K= Lysine, 
L=Leucine, M=Methionine, W=:Asparagine, 
P=Proline, Q=Glutamine, R=»Arginine, 
S=Serine, T^Threonine, V=Valine, 
WsTryptophan, Y=:Tyroeine, X=Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=poeeible nucleotide insertion) 








tr X wc« LfZ3 AijT isy J. Ij iVAt,y iS FDS P YWDD I SDS AKD FI RNLME KDP 

NKRYTCEQAARHPWXAGDTALNKNIHESVSAQIRKNFAKSKMRQ 

AFNATAVVRHMRKLHLGSSLDSSNASVSSSriSLASQKDCflSGT^P 
HAL* 


6200 


702 


i 


L PEVPHSLRPRVKPHLCCAQPAVRVMARLPKLAVPDLDYTLWP F 
WVDTHVDPPFHKSSDGTVRDRRGQDVRLYPEVPEVLKRLQSLGV 
PGAAASRTSEIEGANQLLELFDLFRYFVHREIYPGSKITHFERL 
QQKTGIPFSQMIFFDDERRNIVDVSKLGVTCIHIQNGMNLQTLS 
QGLETFAKAQTGPLRSS LEES PPE A 


6201 


2809 


2383 


GQTPRVRWKMRRSLRAGKRRQTAGRKSkSpPKVPIVIQDDSLPA 
GPPPQIRILKRPTSWGWSSPNSTSRPTLPVKSLAQREAEYABA 
RKRILGSASPBEEQEKPILDRPTRISQPEDSRQPNNVIRQPLGP 
DGSQGFKQRR 


6202 


2 


426 


IKADRAAVASSLLSRPTRKMAPQKDRKPKRSTWRFNLDLTHPVE "■ 
DGIFDSGNFEQFLREKVKVKGKTGNLGNWHIRRFKNKITWSE 
KQFSKRYLKYLTKKYLKKNNLRDWLRWASDKETYELRYFQISQ 
DEDHSESED 


6203 


419 


2550 


RCPRPPATAGATU^RPDRS PPSG I SGSEAAAQAQAAAPASQHPA ' 

TGTGAVQTEAMKQILGVIDKKLRNLEKKKQKLDDYQBRMNKGER 

LNQDQLDAVSKYQEVTNNLEPAKELQRSFMALSQDIQKTIKKTA 

RREQLMREETiEQKRLKTVLELQYVLDKLGDDBVRTDLKQGLNGV 

PILSEEELSLLDEFYKLVDPERDMSLRLNBQYEHASIHLWDLLE 

GKEKPVCGTTYKVLKEIVERVFQSNYFDSTHNHQNGLCEE3EAA 

SAPAVEDQVPEAEPBPAEEYTEQSBVESTEYVMRQFMAETQPTS 

GEKEQVDEWTVETVEWNSLQQQPQAASPSVPEPHSLTPVAQAD 

PLVRRQRVQDLMAQMQGPYNFIQDSMLDFBNQTLDPA2VSAQPM 

NPTQNMDMPQZ.VCPPVHSESRLAQPNQVPVQPEATQVPLVSSTS 

EGYTASQPLYQPSHATEQRPQKEPI DQIQATI SLNTDQTTAS SS 

LPAASQPQVFQAGTSKPLHSSGINYNAAPFQSMQTVFNMNAPVP 

PVNEPETLK00NQYQA5yNQSFSSQPHQVEQTELQQEQI.QTWG 

TYHGSPDQSHQVTGNHQQPPQQNTGFPRSNQPYYNSRGVSRGGS 

RGARGLMNGYRGPANGFRGGYDGYRPSFSNTPWSGYTQSQFSAP 

RDYSGYQRDGYQQNPKRGSGQSGPRGAPRGRGGPPRPNRGMPQM 
NTQQVN 


6204 


2933 


787 


CTHNLISLLGGRALIHFNRFLNLKIQEGEAHNIFCPAYDCFQLV 
PGDI I KS WSKEI^DKRYLQFDIKAFVENNPAI KWCPTPGCDRAV 
RLTKQGSNTSGSDTLSFPLLRAPAVDCGKGHLFCWECLGEAHEP 
CDCQTWKNWLQKI TEMKPEELVGVS EAYEDAANCLWLLTNSKPC 
ANCKS PIQKNEGCNHMQCAKCKYDFCWI CLEEWKKHSPVHWEVI 
YRCTRYEVIQHVEEQSKEMTVEAEKKHKRFQELDRFMHYYTRFK 
NHEHSYQLEQRLLKTAKEKMEQLSRALKETEGGCPDTTFIEDAV 
HVLLKTRRILKCSYPYGFFLBPKSTKKEIFEIiMQTDLEMVTEDL 
AQKVNRPYLRTPRHKI IKAACLVQQKRQEFLAS VARGVAPADSP 
EAPRRS FAGGT WD WE YLG FAS PRRva r fo V"? pt? hp op b Dontrtj e 

LLSNPPDPDEPSESTLDIPEGGSSSRRPGTSWSSASMSVLHSS 
SLRDYTPASRSEWQDSLQALSSLDEDDPNXLLAIQLSLQESGLA 
LDEETRDFLSN2ASLGAIGTSLPSRLDSVPRNTDSPRAALSSSE 
LLELGDS LMRLGAENJDPFSTDTLSSKPLS EARS0FCP SSSDPDS 
AGQDPNINDNLLGNIMAWFHDMNPQS lALI PPATTEI SADSQLP 
CIKDGSEGVKDVELVLPEDSMFEDASVSEGRGTQIEENPLEENI 
PGGGKQHPQAW 


6205 


1 


1200 


RAHRGKMALEVGDMEDGQLSDSDSDKTVAPSDRPLQIiPKVLGGD 
SAMRAFQKTATACAPVSHYRAVESVDSSEESFSDSDDDSCLWKR 
KRQKCFNPPPKPEPPQFGQSSQKPPVAGGKKINNIWGAVLQEQN 
QDAVATELGI LGMEGTI DRSRQSETYNYLLAKKLRKESQEHTKD 
LDKBLDE YMHGGKKMGS KEEBNGQGHLKRKR PVKDRLGNRPEMW 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide" 
(AeiAlanine, C=Cysteine, D«Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L-Leucine, M^Methionins, W=:Asparagine, 
P=^Proline, Q=Glutamine, R=^Arginine, 
S=Serine, T«Threonine, V«Valine, 
W^Tryptophan, Y-Tyroaine, X-Unknown, *=:Stop 
Codon, /sposaible nucleotide deletion, 
\=pGssible nucleotide insertion) 








ykgryeitaedsqekvadbisfrlqepkkdliarwriignkka:"" 

leiilmetaeveqngglfimngsrrrtpggvflnllkntpsises 

qikdifyienqkeyenkkaarkrrtqvlgkkmkqaikslnfqed 

DDTSRETFASDTNEALASIiDESQEGHAEAKLEAEEAlEVrmSHD 
LDIF 


6206 


10 


1442 


I rSERRERSCLHLVC IRCSCDWEMGSVL6LCSMASWI PCLCGS 
APCIiCRCCPSGNNSTVTRIiIYALFLLVGVCVACVMLIPGMEEQ 
LNKI PG FCENEKGWPCNI LVG YKAVYRLCFGLAMPYLLLSLLM 
IIWKSSSDPRAAVHNGFWFFKFAAAIAIIIGAFFIPEGTFT'nn^ 
FYVGMAGAFCFILIQLVLIilDFAHSWNESWVBKMEEGNSRCWYA 
ALLSATALNYLLSLVAIVLPFVYYTHPASCSENKAFrsVNMLLC 
VGASVMSILPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPB 
TNCNPSLLSIIGYNTTSTVPKEGQSVQWWHAQGIIGLILFLLCV 
FYSSIRTSNNSQVNKLTLTSDESTLIBDGGARSDGSIjEDGDDVH 
RAVDNERDGVTYS YS FFHFMLFLASL Yl MMTT>tnwyr vPDQmrM 

KSQWTAVWVKISSSWIGIVLYVWTLVAPLVLTNRDFD 


6207 


2924 


1471 


TVMAEAATPGTTATTSGAGAAAATAAAASPTPIPTVTAPSLGAG 
GGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSPY 
SWCKYFQRGyClYGBRCRyEHSKPLKQEEArATELTTKSSLAA 
SS SLSS 1 VQPLVEMNTGEAES RNSNFATVGAGSEDWVNAIEFVP 
GQPYCXSRTAPSCTEAPLQGSVTKESSEKEQTAVETKKQIiCPYAA 
VGECRYGENCVYLHGDSCDMCGLQVnHPMDAAQRSQHIKSCIEA 
HEKDMELS FAVQRSKDMVCX3 1 CMEWYEKANPSERRFGI LSNCW 
HTYCLKCIRKWRSAKQFESKIIKSCPECRITSNFVlPSEYWVEE 
KEEKQKLILKYKEAWSNKACRYFDEGRGSCPFGGNCFYKHAYPD 
GRRESPQRQKVGfTS SRYRAORliNW?WPT, TPPP PUJCMD-cnxTnT?!--!:- 

VVCP2IjGEMLLMLLAAGGDDELTDSEDEWDLPHDEI,EDFYDLDL 


620B 


2924 


147a 


TVMASAATPGTTATTSGAGAAAATAAAAS PTPI PTVTAPSLGAG 
GGGGGSDGSGGGWTKQVTCRYFMHGVCKEGDNCRYSHDLSDSPY 

swckyforgyciygdrcryehskplkqeeatatelttksslaa 
ss slss rvgplvemntgeaes rnsnfatvgagsedwvnaiefvp 
gqpycgrtapscteaplqgsvtkeesekeqtavetkkqlcpyaa 
vgecrygewcvylhgdscdmcglqvlhpmdaaqrsqhiksciea 

KEKDMELSPAVQRSKDMVCGICMEWVElCfiWPfiPR]?T7r' TT QMrw 

htyclkcirkwrsakqfeskiikscpecritsnfvipseywvee 
keekqklilkykeamsnkacryfdegrgscppggncfykhaypd 
grrbepqrqkvgtssryraqrrnhfwelieerensnpfdndeee 
wtfelgemllfdliiaaggddeltdsedewdlfhdeledfydldl 


6209 


1758 


829 


ERLCFPCMQSKIYSYMSPNKCSGMRPPLQEENSVTHHEVKCQGK 

plagiyrkreekrnagnavrsakkseeqkikdarkgplvpfpnq 

KSEAAEPPKTPPSSCDSTNAAIAKQAIiKKPIKGKQAPRKKAQGK 

tqqnrkltdfypvrrssrkskaelqsberkridelibsgkeegm 
kidlidgkgrgvxatkqfsrgdpweyhgdlieitdakkrealy 

AQDPSTGCYMYYFQYIiSKTYCVDATRETNRLGRLINHSKCGNCQ 

tklhdidgvphliliasrdiaageeli^ydygdrskasieahpwl 

KH 


6210 


3751 


387 


ifgmsklrmvlledsgsadfrrhfvnlspftitwlllsacfvt ~ 

sslggtdkelrlvdqenkcsgrvevkvqeewgtvcnngwsmbav 

svicnqlgcptaikapgwanssagsgriwmdhvscrgnesalwd 

CKHDGWGKHSNCTHQQDAGVTCSDGSNLEMRLTRGGNMCSGRIE 
IKFQGRWGrvCDDNFNIDHASVICRQLECGSAVSFSGSSNFGEG 
SGPIWPDDLICNGNESALWNCKHCGWGKHNTCDHAEDAGVICSKG 
ADLSLRLVDGVTECSGRLBVRFQGEWGTICDDGWDSYDAAVACK 
QLGCPTAVTAIGRVNASKGFGHIWLDSVSCQGHEPAVWQCKHHE 
WGKHYCWHNEDAGVTCSDGSDriELRLRGGGSRCAGTVEVEIQRL 
LGKVCDRGWGLKEADWCRQLGCGSALKTSYQVYSKIQATNTWL 
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SEQ 
10 
NO: 


Predicted 

beginning 

nucleotide 

location 

c or re spond ing 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end" 

nucleotide 

location 

c or re spending 

to first 

amino acid 

residue of 

amino acid 

sequence 


Amino acid se^menc containing signal peptide" 
(A=Alanine, C«Cysheine, D^Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, lalsoleucine, K?=Lysine, 
L=Leucine, M=Methionine, Ns^Asparagine, 
P=ProliTie, Q=Glubamine, R=Arginine, 
S«Serine, T=Threonine, Va Valine, 
W=Tryptpphan, Y^^Tyrosine, X=^Un3cnown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=po3sible nucleotide insertion) 








FIiSSCNGNETSLWDCICNWQWGGLTCDHYKEAKITCSAHREP'RLV 
GGDIPCSGRVEVKHGDTWGS I CDSDFSLEAAS VLCRELQCGTW 

SILGGAHP6EGNGQIWAEEFQCEGHESHLSLCPVAPRPEGTCSH 
S RD VG WCS RYTE I RLVNGKT PCEGRVELVTT jnjx wr* c t r-NrouMn 

lEDAHVLCQQLKCGVALSTPGGARFGKGNGOIWRHMPHCTGTEQ 
HMGDCPVTALGASLCPSEQVASVICSGNQSQTLSSCNSSSUSPT 
RPTIPEESAVACIESGQLRLVNGGGRCAGRVEIYHEGSWGTICD 
DSWDLSDAHWCRQLGCGEAINATGSAHFGEGTGPIWLDEMKCN 
GKESRIWQCHSHGWGQQNCRHKEDAGVICSEFMSLRLTSEASRE 
ACAGRLEVFYNGAWGTVGKSSMSETTVGWCRQLGCADKGKINP 
ASLDKAMS I PMWVDNVQCPKGPDTLWQCPSS PWBKRLASPSEET 
WITCDNKIRLQEGPTSCSGRVEIWHGGSWGTVCDDSWDLDDAQV 
VCQQLGCGPALKAFKEAEFGQGTGPrWLNEVKCKGNESSLWDCp 
ARRWGHSECGHKEDAAVNCTDISVQicrPQKATTGRSSRQSSFIA 
VGILGWLIAIFVALPFLTKKRRQRQRLAVSSRGENLVHQIQYR 
E WNS CLNADDLDLMNS S GGHSE PH 


6211 

- 


3761 


387 


IFGMSKLRMVLLEDSGSADFRRHFVWLSPFTltTWLLLSACFVT ' 

SSLGGTDKEIiRLVDGENKCSGRVEVKVQEEWGTVCNNGWSMEAV 

SVXCNQLGCPTAIKAPGWANSSAGSGRIWNDIIV3CRGNESALWD 

CKHDGMGKHSNCTHQQDAGVTCSDGSNLEMRLTRGGNMCSGRIE 

IKPQGRWGTVCDDNFNIDHASVICRQLECGSAVSFSGSSMFGEG 

SGPIWPDDLIOfGNESALWNCKHQGWGKHNCDHAEDAGVICSKG 

ADLSLRLVDGVrECSGRLEVRFQGEWGTICDDGWDSynAAVACK 

QLGCPTAVTAIGRVNASKGFGHIWLDSVSCQGHEPAVWQCKHHE 

WGKHYCNHNEDAGVTCSDGSDLELRLRGGGSRCAGTVEVEIQHL 

LGKVCDRGWGLKEADWCRQUSCGSALKTSYQVYSKIQATKTWL 

FLSSCNGNETSIiWDCKNWQWGGLTCDHYEEAKITCSAHREPRLV 

GGD I PCSGRVEVKHGDT WGS ICDSDFSIiEAAS VLCREIiQCGT W 

SILGGAHPQEQNGQIWAEEFQCEGHESHLSLCPVAPRPEGTCSH 
SRDVGVVCSRYTEIRrjVNf5TirTPPT7(^D'TrwT vpt r>nMf^*>r m-r^T-n-T^ 

lEDAHVJCCQQLKCGVALSTPGGARFGKGNGQIWRHMFHCTGTEQ 
HMGDCPVTALGASLCPSEQVASVICSGNQSOTLSSCNSSSLGPT 
RPTIPEESAVACIESGQLRLVNGGGRCAGRVBIYHEGSWGTICD 
DSWDLSDAHWCRQWCGBAINATGSAHFGEGTGPIWLDEMKCN 
GKESRlWQCHSHGWGQQKCRHKEDAGVICSEFMSLRIiTSEASRB 
ACAGRLEVFYKGAWGTVGKSSMSETTVGWCRQLGCADKGKIWP 
ASLDKAMSIPMWVDKVQCPKGPDTLWQCPSSPWEICRLASPSEST 
WITCDNKIRLQEGPTSCSGRVEIWHGGSWGTVCDDSWDtiDDAQV 
VCQQLGCGPALKAFKEAEFGQGTaP I WLWEVKCKGWESSLWDCP 
ARRWGHSECGHKEDlAAVKCTDISVQKTPQKATTGRSSRQSSFIA 
VGILGWLLAIFVALPFLTKKRRQRQRLAVSSRGEWLVHQIQYR 
BMNSCIiNADDLDLMNSSGGHSEPH 


6212 


1 


1134 


LKWELRPGGAVWGTGRGAGTGAPRSCCCQTNPGPPSSLRRAFRR 
RELPFPACHEIGLGAEAGSGPPPAPAARESRSRAMEEEASSPGL 
GCSKPHLBKLTLGITRILBSSPGVTEVTIIEKPPAERHMISSWE 
QKNNCVMPEDVKNFYLMTKGFHMTWSVKLDEHIIPLGSMAINsr 
SKLTQLTQSSMYSLPNAPTLADLEDDTHEASDDQPEKPHFDSRS 
VIPELDSCNGSGKVCLVYKSGKPALAEDTEIWFLDRALYWHFLT 
DTFTAYYRLLITHLGLPQWQYAFTSYGISPQAKQRVSMYKPITY 
NTNLLTEETDSFVNKLDPSKVFKSKNKIVIPKKKGPVQPAGGQK 
G9$GV3GPSTSSTSKS$SGSGNPTRK 


6213 


1 


1134 


liKWELRPGGAVWGTGRGAGTGAPRSCCCQTNPGPPSSLRRi^FRR 
RELPFPACHE IGLGAEAGSGPP PAPAARESRSRAMEEEASSPGL 
SCSKPHLEKLTLGITRILESSPGVTEVTIIEKPPAERHMISSWE 
5KNNCVMPEDVKNFYLMTNGFHMTWSVKLDEHIIPLGSMAINSI 
SKLTQLTQSSMYSLPWAPTLADLEDDTHEASDDQPBKPHPDSRS 
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SEQ 
ID 
NO: 


Predicted ' 
begiziningr 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F^Phenyl alanine, G^Glycine, 
H»Hi3tidine, I-Isoleucine, K=Lysine, 
Ls^Leucine, M=Methxonine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S:=Serine, TcThreonine, Valine, 
Wi=Tryptophan, YsTyrosine, X^Unknown, *=Stop 
Codon, /sipossible nucleotide deletion, 
\=possible nucleotide insertion) 








VIFELDSCNGSGKVCLVYKSaKPALAEDTEIWPLDRALYWHFLT 
DTFTAYYRLLITHIiGLPQWQYAPTSYGISPQAKQRVSMYKPITY 
NTNLLTEETDSFVNKLDPSKVFKSKNKIVIPKKKGPVQPAGC3QK 
GPSGPSGPSTSSTSKSSSGSGNPTRK 


6214 


2 


450 


HEIAPSAIRRAARLCLGPARMQSRAAAt^FVRGFRTGWSFVGWV" 
VLGTSAKRTRLFFFLSKMAASSRAQVIALVRAMLRESKRFSAYN 
YRTYAVRRIRDAFRENKNVKDPVEIQTLVNKAKKDLGVIRRQVH 
IGC2IiYSTDKLI I ENRDMPRT 


6215 


2 


1849 


FVAGG PRGSGSA^^TMPE IRVTPLGAGQDVGRS C ILVS lAGKNV 
MLDCGWHMGFNDDRRFPDFSYI'l-QNGRr.TDFLDCVIISHFHLnH 
CGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDXKGE 
ANPFTSQMIKDCMKKWAVHLHQTVQVDDELEIKAYYAGHVLGA 
AMFQIKVGSESWYTGDYNMTPDRHLGAAWIDKCRPNLLITEST 
YATTIRDSKRCRERDFIiKKVHETVERGGKVLIPVFAIiGRAQELC 
ILLETFWERhlNLKVPIYFSTGLTEKANHYYKLFIPWTTNfQKIRIcr 
FVQENMPEFKHIKAFDRAFADNPGPMWFATPGMLHAGQSLQIF 
RKWAGNEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVK 
MOVE YMS FSAHADAKGIMQLVGQAE PES VLLVHGEAKKME PLKQ 
KIEQELRVNCYMPANGETVrLPTSPSIPVGISLGLLKREMAQGL 
LPEAKKPRLLHGTLIMKDSNFRLVSSEQAUKELGLAEHQLRFTC 
RVHLHDTRKBQETAI^VYSHLKSVLKDHCVQHLPDGSVTVESVL 
LQAAAPSEDPGTKVLLVSWTYQDEELGSFlaTSLLKKGLPQAPS 


6216 


11 


393 


QTTRPEPRNSALRQSRSKMAWGVSSVSRLIiGRSRPQLGRPMSS 
GTUIGEEGSARMWKTLTFPVALPGVAVSMIimnfLKSHHGEHERPE 
FIAYPHLRXRTKPFPWGDGNHTLFHNPHVNPIiPTGYEDE 


6217 


9 


1178 


TRVGRGESGLKMEVKPPPGRPQPDSGRRRRRRGEEGHDPKEPEQ 
LRKLF rGGLSPETTDDSLREHPEKWGTLTDCWMRDPQTKRSRG 
FGFVTYSCVBBVDAAMCARPHKVDGRWBPKRAVSREDSVKPGA 
HLTVKKIFVGGIKEDTEEYNLRDYFEKYGKIETIEVMEDRQSGK 
KRGFAFVTFDDHDTVDKIWQKYHTINGHNCEVKKALSKQEMQS 
AGSQRGRGGGSGNFMGRGGNFGGGGGNFGRGGWFGGRGGYGGGG 

ggsrgsygggdggyngfggdggnygggpgyssrggyggggpgyg 
wqgggygggggydgyneggnfgggnyggggnyndfgnysgqqqs 
ktygpmxggsfggrsSgspygggygsgggsggygsrrf 




130S 


906 


SCERRGFIMADDLKRFLYKKLPSVEGLHAIWSDRDGVPVIKVA 
NDNAPEHALRPGFLSTFALATDQGSKIiGLSKNKSI ICYYNTYQV 
VQFWRLPLWSFIASSSANTaLrVSLEKELAPLFEELRQWEVS 


6219 


2 


890 


AGPGEGAGAGTRCAGAEAEMASAGGEDCESPAPEADRPHQRPPL 
IGVSGGTASGKSTVCEKXMELLGQNEVEQRQRKWILSQDRFYK 
VLTAEQKAKALKGQYNPDHPDAPDNDLMHRTLKNIVEGKTVEVP 
TYDFVTHSRLPETTWYPADWLFEGILVFYSQEIRDMFHLRLF 
VBTDSDVRI/SRHVLRDVRRGRDLEQILTQYTrFVKPAFEEPCLP 
TKECYADVr I PRGVDNMVAlNIilVQHIQDILNGDICKWHRQGSWQ 
RSYKRTFSEPGDHPGMLTSGKRSHLESSSRPH 


6220 
62il 


227 


764 


EQNISIiEMSCTIEKALADAKALVERUlDHDDAAESLI EQTTALW"' 

KRAraAMKQYQEEIQKLNEVARHRPRSTLVMGIQQENRQIRELQQ 

ENKELRTSLEEHQSALELIMSKYREQMFRLLMASKKDDPGIIMK 

LKEQHSKIDMVHRWKSEGFFLDASRHILEAPQKGLERRHLEANQ 
NVH 




98 


916 


RWIWDLNPVSDGLELRPKYNGILHCIiTTIWKLDGLRGLYQGVTP 
NIWGAGIiSWGLYFVFYNAIKSYKrEGRAERLEATEYLVSAAEAG 
AMTLCITNPLWVTKTRLMLQYDAWNSPHRQYKGMPDTLVKIYK 
YEGVRGLYKGFVPGLFGTSHGALQFMAYELLKLECYKQHINRLPE 
AQLSTVEYISVAALSKIFAVAATYPYQWRARLQDQHMFYSGVI 
DVITKTWRKEGVGGFYKGIAPNLIRVTPACCITFWYENVSHFB 
LDIiRBKRK 
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PCT/USOO/34263 



SEQ 
ID 
NO: 

6222 


Predicted 
becfinning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Jmrno acxa segment containing signal peptT3i 
(A-Alanine, C=Cysteine, D^Asparfcic Acid, B» 
Glutamic Acid, P^Phenylalanine, G-Glycine 
H«Hi3tidine, X==Isoleucine, K=i,yeine 
L^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
SsSerlne, T=Threonine, V^rValine, 
W=Tryptophan, y=Tyrosine, X=UnJcnown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 




6223 


2 


2115 


MARELRALi,LWGRRLRPLLRAPALAAVPGGKPILCPR5TTAQI.G 

PRRNPAWSLQAGRLFSTQTAEDKEEPLHSriSSTESVQGSTSKH 

EPQAETKKLLDIVARSLYSEKEVFIRELISNASDALEKLRHKLV 

SDGQALPEMEIHLQTNAEKGTITIQDTGIGMTQEELVSNLGTIA 

RSGSKAPLDALQNQAEASSKIIGQFGVGPYSAFMVADRVEVYSR 

SAAPGSLGYQWLSDGSGVFEIABASGVRTGTKIIIHLKSDCKEF 

SSEARVRDWTKYSNFVSPPLYLNGRRMNTLQAIWMMDPKDVRE 

WQHBEPYRYVAQAHDKPRYTLHyKTDAPLMrRSIFyvPDMKPSM 

FDVSRELGSSVALYSRKVLIQTKATDILPKWLRFIRGWDSEOI 

i'XiWLitoKiijjL.QSSAIiiRKIjRDVljQQRLIKPFIDQSKKnAEKYAKF 

FEDYGLFMR^GIVTATEQEVKEDIAKLLRVESSALPSGOLTSLS 

SYASRMPJWSTRNIYYLCAPNRHLAEHSPYYEAMKKKDTEVLFCF 

EQFDELTLLHLREFDKKKLISVETDIWDHYKEBKFEDRSPAAE 

CLSEKETEELMAWMRNVLGSRVTKTVKVTLRLDTHPAMVTVLBMG 

AARHPLRMQQLAKTQEERAQLLQPTLEINPRHALIKKLNQLRAS 

EPGLAQLLVDQIYENAMIAAGLVDPPRaMVGRIJJEU.VKAT.RPW 




6224 


3 
1 


715 
133 


DAWARTMAGMVDFQDEEOVKSFLENMEVECNIiHCXHKKDPDGCY 

RLVDYLEGIRKNFDEAAKVLKFKCEENQHSDSCYKLGAYYVTGK 

GGLrODLKAAARCFLMACBKPGKKSIAACHNVGLLAHDGQVNBD 

GQPDLGKARDYYTRACDGGYTSSCFNLSAMFLC2GAPGFPKDMDL 

ACKYSMKACDLGHIWACANASRMYKLGDGVDKVEAKAEVLKNRA 
QQVHKEQQKGVQPLTFG 




622S 
6226 


3259 


938 


XiKTXSSMAWGPLLLTLLAHCTGSWAQSVLTOPPSVSGARIPHEK ' 

liLSCHRLAHjKLPFSVESRKTVMGPQGARRQAFLAPGDVTVDFT 

OKEWRLLSPAQRALYREVTLENYSHLVSLGILHSKPBLIRRLEQ 

GEVPWGEERRRRPGPCAGIYAEHVLRPKNLGLAHQRQQQLQPSD 

QSFQSDTAEGQEKEKSTKPMAFSSPPLRHAVSSRRRNSWEIES 

SQGQRENPTEIDKVLKGIENSRWGAFKCAERGQDFSRKMMVIIH 

KKAHSRQKLFTCRECHQQFRDESALLLHQNTH'TGBKSYVCSVCG 

RGFSLKANLLRHQRTHSGEKPFLCKVCGRGYTSKSYLTVHERTH 

TGEKPYECQECGRRFNDKSSYNKHL-CAHSGEKPFVCKECGRGYT 

NKSYFWHKRIHSGEKPYRCQECGRGPSNKSHLITHQRTHSGEK 

PFACRQCKQSFSVKGSLLRHQRTHSGEKPFVCKDCERSFSQKST 

LVYHQRTHSGEKPFVCRECGQGPIQKSrLVKHQITHSEEKt^PVC 

KDCGRGFIQKSTFTLHQRTHSEEKPYGCRECGRRFRDKSSYNKH 

LRAHLGEKRFFCRDCGRGPTLKPNLTIHQRTHSGEKPFMGKQCE 

KSPSLKANLLRHQWTHSCERPFNCKDCGRGFILKSTLLFHQKTH 

SGBKPPlCSBCGQQFIWKSlfLVKHQLAHSGKQPFVCKECGRGFW 

WKGKLLTHQRTHSGEKPFVCNVCGQGFSWKRSLTRHHWRIHSKE 

KPFVCQECKRGYTSKSDLTVHERIHTGERPYECQECGRKFSNKS 

YYSKHLKRHLREKRFCTGSVGEASS 


^27 


29 


266 


Tls.V£iJ!:LLGGSgRLFFLPLWRRLCRCGLGPRVSPMAGPRVEVDGS 
IMEGGGQSLRVSTGLSWLLSLPWRAQRIRAGRSYA 




2581 


890 

• 

1 
( 
} 
I 
f 
\ 

1 


u Q * **-vj \^LH A V y vti\^i^u\jtuauuDcKi:'xLji5 PQAR P 
^JNAYTAMSDSYLPSyYSPSIGFSYSLGEAAWSTGGDTAMPvLTS 
YGQLSNGEPKFLPDAMFGQPGALGSTPFLGQHGFNl^fPSGIDPS 
^MGNWSSQGQSTQSSGYSSNYAYAPSSLGGAMIDGQSAFANETL 
^JKAPGMNTIDQGDOAALKLGSTEVASNVPKVVGSAVGSGSITSNI 
/ASNSLPPATIAPPKPASWADXASKPAKQQPKLKTKNGIAGSSL 
^PPPIKHfJMDIGTWDNKGPVAKAPSQALVQNIGQPrQGSPQPVG 
3QANNSPPVAQASVGQQTQPLPPPPPQPAQLSVQQQAAQPTRWV 
^PRNRGSGFGHNGVDGNGVGQSQAGSGSTPSEPHPVLEKLRSIKT 
lYNPKDFDWNLKHGRVFIIKSYSEDDIHRSIKYHIWCSTEHGWK 
ILDAAYRSMNGKGPVYLLFSVNGSGHFCGVAEMKSAVDYNTCAG 
rWSQDKWKGRPDVRWXFVKDVPNSQLRHIRLEKNENKPVTNSRD 
'QEVPLEKAKQVLKIIASYKHTTSIPDDFSHYEKRQ 
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SEQ 
ID 
WO; 


Predicced 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
CA=Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K= Lysine, 
L=i.eucine, M-Methionine, N=Asparagine, 
P-Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, Vt=Valine, 
W=rryptophan, Y^Tyrosine, X«Unknown, *=Stop 
Codon, /«possible nucleotide deletion, 
\=possible nucleotide insertion) 


£228 
6229 


47 


1978 


GRRCRRRGAVMELAQEARELGCWAVEEMGVPVAARAPESXrRRIj 
CLGQGADIWAYILQHVHSQRTVKKIRGNLLWYGHQDSPQVRRKL 
ELEAAVTRLRAEIQELDQSLELMERDTEAODTaMPn at? nuTonT 

QRRALLLRAQAGAMRRQQHTLRDPMQRLQNQLRRLQ0MERKAKV 
DVTPGSLTSAALOLEPWLRDVRTACTLRAQPLQNLLI,PQAKRG 
SLPTPHDDHFGTSYQQWLSSVETLLTNHPPGHVLAALEHLAAER 
EAEIRSLCSGDGLGDTBISRPQAPDQSDSSQTLPSMVHLIQEGW 
RTVGVIiVSQRSTLLKERQVLTQRLOGLVEEVERRVLGSSERQVL 
ILGLRRCCLWTELKAIiHDQSQSLQDAAGHRQLLLRELQAKQQRI 
LHWRQLVEBTQEQVRLLIKGNSASKTRLCRSPGEVIALVQRKW 
PTFEAVAPQSRKLLRCLEEEVRHLPHILLGTLLRHRPGELKPLP 
TVLPSIHQLHPASPRGSSFIALSHKLGLPPGKASELLLPAAASL 
RQDLLLLQDQRSLWCWDLLHMKTSLPPGLPTQELLQIQASQBKQ 
QKENLGQALKRLBKLLKQALERIPELQGIVGDWWEQPGQAALSE 
ELCQGLSLPQWRLRWVQAQGALQKLCS 


6230 


1571 


560 


GPSLIiGTRGTPNPARTLQIFFLlIGRRbTGRMAAVDDLQFEEfG^ 
NAATSLTANPDATTVNIEDPGETPKHQPGSPRGSGREEDDELLG 
NDDSDKTELLAGQKKSS PFWTFE YYQTFFDVDT YQVPDR I KGSL 

LPIPGKNPVRLYlRSHPDLYGPPWICATLVFArAXSGNLSNPLI 
HLGEKTYHWPEFRKVSIAATIIYAVAftrr.vPTAT.wnwT mudxtov 

VMNIVSYSPLEIVCVYGYSLFIYIPTAILWIIPHKAVRWILVMI 
ALGISGSLIAMTPWPAVRBDNRRVAIiATIVTIVLLHIMLLSVGCL 
AYFFDAPEMDHLPTTTATPNQTVAAAKSS 




1723 


600 


skmsgrsgkkkmsklsrsaragvippvgrlmryLkkgtpkyris 

VGAPVYMAAVIEYLAAEILEIAGKAARDNKKARIAPRHILLAVA 
*-i-j-u.i(srfijjjrviji V X i/i&vjrfj V jjt K Xlli'jbtJjIjAKKRGTK^ 

ppekrgrkatsgkkggkkskaakprtskkskpkdsdkegtsnst 
sedgpgdgptilsskslvlgqklsltqsdishigsmrvegivhp 
ttaeidlkbdigkalekaggkefletvkelrksqgplevaeaav 

SQSSGLAAKFVIHCHIPQWGSDKCEEQLEETIKNCLSAAEDKKIi 
KSVAFPPFPSGRNCFPKQTAAQVTIiKAISAHPDDSSASSLKNVY 
FLLFDSES IGI YVQEMAiCLDAK 


6231 
6232 


149 


870 


l.l*'asSTMDiiSJ.RNVLWSFGFl,LI>FTAYGGLQSLQSSLYSEEG 

lgvtalstlyggmllssmflppllierlgckgtiii^mcgyvaf 

SVGNFPASWYTLrPTSILIiGLGAAPLWSAQCTyi.TlTGNTHAEK 

agkrgkdmvnqyfgifflifqssgvwgnlisslvfgqtpsqetl 

PEEOLTSCaASDCLMATTTTNSTQRPSQQLVYTLIiGIYTGSGVL 
AVLMIAAFLQP IRDVQRESE 


6233 


3679 
1 


1476 
2654 ] 


IrVAGTTMAGFWVGTAPLVAAGRRGRWPPQQtMLSAALRTLKHVL 

YYSRQCLMVSRNLGSVGYDPNEKTFDKILVANRGEIACRVIRTC 

KKMGIKTVAIHSDVDASSVHVKMADEAVCVGPAPTSKSYLNMDA 

IMEAIKKTRAQAVHPGYGPLSENKEFARCriAAEDVVFIGPDTHA 

rOAMGDKIBSKLIiAKKAEVNTlPGFDGWKDAEEAVRIAREIGY 

PVMIKASAGGGGKGMRIAWDDEETRDGFRLSSQEAASSFGDDRI. 

111 ajt* XDNPRHIEIQVLGDKHGNALWLNERECS I QRRNQKWEE 

APSIFLDAETRRAMGEQAVALARAVKYSSAGTVEFLVDSKKNFY 

FLEMNTRLQVEHPVTECITGLDLVQEMIRVAKGYPLRHKQADIR 

rNGWAVECRVyAEDPYKSFGLPSIGRLSQYQEPLHLPGVRVDSG 

IQPGSDISlYYDPMISKLITYGSDRTEALKRMADAIiDNYVIRGV 

THNIALLREVIINSRFVKGDISTKFLSDVYPDGFKGHMLTKSEK 

NQLLAIASSLFVAFQIJRAQHFQENSRMPVIKPDIANWELSVKLH 

DK^/HTWASl^GSVFSVEVDGSKLNVTSTWNIASPLLSVSVDGT 

aRTVQCLSREAGGNMSIQFLGTVYKVNILTRLAAELNKPMLEKV 

rEDTSSVLRSPMPGVWAVSVKPGnAVAEGQEICVIEAMKMQWS 

VITAGKTGTVKSVHCQAGDTVGEGDLLVELE 

^STRENIJffAGiNrJ^^FPSEGHLVRSTGPGGSPAKH^fVAQCVSPKGP 



470 



wo 01/53312 



PCTyUSOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, P=Phenylaaanine, G^^Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M^Methionine, N^Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W;=Tryptophan, Y=Tyrosine, X^Unknown, *^StQp 
Codon, /^possible nxicleotide deletion, 
\=possible nucleotide insertion) 








LACSRTYFFGATHVPYLGGDSKLPKKTEQIRLLSQIYAAVIEAV 
LAGIACYAKTSSLTKAKEVAKQTLGSGLDSFEIilPFKAALRSKM 
TFHIHAVNNQGRIVPLDSEDSLSFVKTACMAVYDIPDLLGGNGC 
U3SWFSESFLTSQILVKEKDGTVTTETSSVVLTAAVPRFCSWL 
VEDNEVKI*SEKXHQAVRGDESPI*GTYLTGGEGAYLYSSNLQSWP 
EEGNVHFFSSGLLFSHCRHGSIIISKDHMNSISFYDGDSTSTVA 
ALLIDPKSSLLPHLPVHFHGS SNFLMI AIiFPKSKI YQAFYS EVF 
SLMKQQDNSGISLKVIQEDGLSVEQKRLHSSAQKLFSALSQPAG 
EKRSSLKIiLSAKLPELDWFLQHFAISSISQEPVMRTHLPVLLQQ 
AErNTTHRIESDKVIISIVTGLPGCHASELCAFLVTLHKECGRW 
MVYRQIMDSSECFHAAHFQRYLSSALEAQQNRSARQSAYIRKKT 
RLLWLQGYTDVIDWQALQTHPDSNVKASFTIGAITACVEPMS 
CYMEHRFLFPKCLDQCSQGLVSNWPTSHTTEQRHPLLVQLQSL 
IRAANPAAAFILAENGIVTRNEDIELILSENSFSSPEMLRSRYL 
MYPGWYEGKLNAGSVYPLMVgrCVWFGRPLEKTRPVAKCKAIQS 
SXKPSPFSGNIYHILGKVKFSDSERTMEVOrNTLANSLSIMPVL 
EGPTPPPDSKSVSQDSSGQQECYLVFIGCSLKEDSIKDWLJJQSA 
KQKPQRKALKTRGMLTQQEIRSIHVKRHIjEPLPAGYFYNGTQPV 

NFFGDKTDPHPLMDQFMBTOYVEEANREIEKYKfQELEQQEYHDLF 
ELKP 


6234 


1731 


404 


PRVREDMDHKSPGNKGSLVYAGIKSIVKSSLGMVESSRHNWSGL 
DKQSDIQNLNEERILAIiQLCGWXKKGTDVDVGPFLNSLVQEGEW 
ERAAAVALFMLDIRRAIQIIiNEGASSEKGDliJLNVVAMAIjSGYT 
DEiOTSLWREMCSTLRLOLNNPyLCViyiFAFLTSETGSYDGVLYEN 
KVAVRDRVAFACKFIiSCrrOLNRYIEKLTNEMKEAGNLEGILLTG 
LTKDGVDLMES YVDRTGDVQTAS Y CMT.QGSPLDVLKDERVQYW I 
ENYRNLLDAITOFWHKRAEFDIHRSKLDPSSKPLAQVFVSCNFCG 
KSISYSCSAVPHQGRGPSQYGVSGSPTKSKVTSCPGCRKPLPRC 
ALCLINMGTPVSSCPGGTKSDEKVDLSKDKKLAQFNNWPTWCHN 
CRHGGHaGHMLSWFRDHABCPVSACTCKCMQLDTTGNLVPAETV 
QP 


5235 


I 


571 


EKRDHRLPSWPRAALKVPGRGGRVGTTPEIiAAGGlMATRNPPPQ 
DYESDDDSYEVLDLTEYARRHQWWNRVFGHSSGPMVEKYSVATQ 
IVMGGVTGWCAGFLFQICVGKLAATAVGGGPLLLQIASHSGYVQI 
D>7KRVEKDVNKAKRQIKKRANKAAPEINNLIEEATEFIKQNIVI 
SSGFVGGFLLGLAS 


6236 


1 


703 


WDQNKGAAAQSGLTLPSLPSARFSAGPPTQRSRPTMSNMEKHIjF 
NLKFAAKELSRSAKKCDKEEKAEKAKIKKAIQKGNMEVARrHAE 
WAIRQKNQAWFLRMSARVDAVAARVQTAVTMGKVTKSMAGWK 
SMDATLKTMNLEKISAbMDKFEHQFETLDVQTQQMEDTMSSTTT 
LTTPQETQVDMLLQEMADEAGLDLNMELPQGQTGSVGTSVASAEQ 
DELSQRIARLRDQV 


6237 


312 


720 


PTAMAEEGIAAGGVMDVNTALQEVLKTALIHDGIoARGIREAAKA" 
LDKRQAHbCVIiASNCDEPMYVKLVEAIjCABHQINLIKVDDWiaai 
GEWGLCKIDREGKPRKWGCSCVWKDYGKESQAKDVIEEYFK 
CKK 


6238 


2 


4666 


ebvptqesvkweinviikmpeivfvadmtkwdapalvittqceT" 

CYKGNLENSTMTAAIKDIiQVRACPFLPVKRKGKITTVLQPCDLF 
YQTTOKGTDPQVIDMSVKSLTLKVSPVIINTMITITSALYTTKE 
TIPBETASSTAHLWEKKDTKTLKMWFLEESNETEKIAPTTELVP 
KGEM I KMNIDS I FI VLBAG IGHRTVPMLLAKSR FSGEGKNWSS L 
INLHCQLELEVHYYNEMFGVWEPLLEPLBIDQTEDFRPWWLGIK 
MKKKAKMAIVESDPEEENYKVPEYKTVISFHSKDQLNITLSKCG 
LVMLNNr*VKAFTSAATGSSADFVKDLAPFKIIiNSLGLTISVSPS 
DSPSVLNIPMAKSYVLKNGESLSMDYIRTKDNDHFNAWTSLSSK 
LFFILLTPVNHSTADKIPLTKVGRRLYTVRHRESGVERSIVCQI 



471 



wo 01/533t2 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicte<k end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
secpi&nce 


Amino acid segment containing signal peptide 
(A-Alonine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P= Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leuclne, M=Methionine, N=Asparagine, 
P«Proline. Q=Glutamine, R=^rginine, 
S=Serine, T=:Threonine , V=Valine, 
W=Tryptophan, Y«Tyrosine, X«Unknown, *=.Stop 
Codon, /=possil>le nucleotide deletion, 
\=possible nucleotide insertion) 










DTVEGSKKVTIRSPVQIRNHFSVPLSVYEGDTLLGTASPBNEFW 
I PLGS YRSFI FLKPEDBNYQMCEGI DFEE 1 1 KNDGALL KKKCRS 
KNPS KES FLINI VPEKDNLTSLS VYSEDGWDLP YIMHLWPPILL 
RNLLPYKIAYYIEGIENSVFTLSEGHSAQICTAQLGKARLEILKL 
LDYLMHD WKSE YH I KPNQQDI S FVS FTCVTEMEKTDLD lAVHMT 
YNTGQTWAPHSP YWMVWKTGRMLQ YKADGI HRKHP PK YKKPVL 
FSFQPNHFPNNNKVQLMVTDSELSWQFS 1 DTVGSHGAVKCKGLK 
MDYQVGVTIDLSSFNITRIVTFTPFYMIKNKSKYHISVAEEGND 
KWLSLDLEQCI PFWPEYAS SKLLI QVERSEDPPKRI YFNKQENC 
ILLRLDKELGGXIAEVKLAEHSTVITFLDYHDGAATPLLINHTK 
NELVQYNQSSLSEIEDSLPPGKAVFYTWADPVGSRRLKWRCRKS 
HGEVTQKDDMMMPIDLGEKTIYLVSFFEGLQRIILFTEDPRVFK 
VTYESSKAEliAEQEIAVALQDVGISLVNKYTKQEVAYIGITSSD 
WWETKPKKKARWKPMSVKHTEKIiEREFKBYTESSPSEDKVIQL 
DTJIVPVRLTPTGHNMKILQPHVIALRRNYLPALKVEYNTSAHQS 
SFRIQIYRIQXQNQIHGAVFPFVFYPVKPPKSVTMDSAPKPFTD 
VS I VMRSAGHSQISR I KYFFCVLIQEMDLRLDLGFI YALTDLMTE 
AEVTENTEVELFHKDIEAFKEEYKTASLVDQSQVSLYEYFHISP 
IKLHLSVSLSSGREEAKDSKQNGGLIPVHSLNIjLIiKSIGATLTD 

vqdwfklaffeiinyqfhtrsdlqsevirkys kqai kqm yvlil 
gldvlgnppglirefsegveaffyepyqgaiqgpbbfvegmaiig 
lkalvggavgglacaaskitgamakgvaamtmdedyqqkrream 

NKQPAGFREGITRGGKGIiVSGPVSGITGIVTKPIKGAjQKGGAAG 
FFKGVGKGLVGAVARPTGGIIDMASSTFQGIKRATETSEVESLR 
P PRFFNEDGVIRP YRLRDGTGNQMIjQKIQFYRE W I MTHS SSS DD 
DDDDDDDDESDI/>JH 




6239 


2108 


S34 


KPGMAGKGSSGRHPLLLGLLVAVATVHLVICPYTKVEESFWIjQA 
TKDLLYHWQDLEQyDKLEFPGWPRTFLGPWIAVFSSPAVYVL 
SIiLEMS KPYSQLI VRGVLGLGVl FGLWTLQKEVRRHFGAMVATM 
FCWVTAMQPHLM FYCTRTIiPNVLAIiPVVLrJUiAAWLRHEWARFI 
WLSAFAI I VFRV3LCLFLGI*Lt.LIAIiGNRKVSWRAI.RHAVPAG 
ILCLGLTVAVDSYFWRQX)TWPEGKVLVnfNTVIiNKSSNWGTSPIiL 
WYFYSAI^RGLGCSLLFIPLGIiVDRRTHAPTVLALGFKALYSLL 
PHKELRFIIYAFPMLNITAARGCSYLLNNYKKSWLYKAGSLLVI 
GHLWNAAYSATAIiYVSHFNYPGGVAMQRLHQLVPPCyrDVLLHI 
DVAAAQTGVSRFLQVNSAWRYDKREDVQPGTGMLAYTHILMEAA 
PGXiIALYRDTHRVLASWGTTGVSLNIiTQLPPFNVHLQTKIiVLL 
ERLPRPS 


&240 


2202 


1176 


HERGDSLKEPTSIAESSRHPSYRSEPSLEPESFRSPTFGKSFHF 
DPLSSGSRSS SLKSAQGTGP3LGQLQS XRSEGTTSTS YKSLAWQ 
TRNGSLS YDS LLTPSDS PDF3S VQAGPEPDPPLGYTSPFLSARL 
AQQREAERHPRLVPTGPTHR2PSPVRYDNLSRHIVASLQEREKL 
IiRQSPPLPGREEEPGLGDSGIQSTPGSGHAPRTSSSSDDSKRSP 
I£KT PLGRPAVPR FGKPDGLRGRGVGS PEPG PTAP YLGRSMS YS 
SQKAQPGVSETEEVALQPIiLTPKDEVQLKTTYSKSNGQPKSLGS 
ASPGPGQPPLSSPTRGGVKKVSGVGGTTYEISV 


6241 


3 


1341 


RNAEEKKRIiSLQREKI lARVS I DNRTRALVQALRRTTDP KLCI T ' 

RVEBLTFHLLEFPEGKGVAVKERIIPYLLRLRQIKDETLQAAVR 

EILAL IGYVDPVKGRGIRI LS I IXSGGTRGWAIiQTLRKLVELTQ 

KPVHQLFDYICGVSTGAILAFMLGLPHMPLDECEELYRKLGSDV 

FSQNVIVGTVKMSWSHAFYDSQTWENIIiKDRMGSALMIETARNP 

TCPKVAAVSTIVNRGITPKAFVFRNYGHPPGINSHYLGGCQYKM 

WQAIRASSAAPGYFAByALG^^?r^HQDGGIJJLNNPSi^JlA^IHBCKC 

LWPDVPLECIVSLGTGRYESDVRNTVTYTSLKTKrjSNVINSATD 

TEEVHIMLDGLLPPDTYPRPtrPVMCENIPLDESRNEKI*DQLQLE 

GtiKYlERNEQKMKKVAKH,SQEKTTLQKINDWIKI,KTDMYEGI.P 
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SEQ 
ID 
NO: 


Predict eca 
beginning 
nucleotide 
location 
correspotiding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding- 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=cysteine. D=flspartic Acid, E== 
Glutamic Acid, P- Phenyl alanine, G=Glycine, 
H=Histidine, I-Isoleucine, K^Lysine, 
L=Leucine, M^Methionine, N=Asparagine, 
P= Proline, Q==Glut amine, R=Arginlne, 
S^Serine, T=Threonine, V-Vallne, • 
K-iiypLopnan^ x-iyrosme, X-unknown, *=3Stop 
Codon, /^possible nucleotide deletion, 
Xspossible nucleotide insertion) 
PPSKL 




6242 


198 


1310 


QHFLPGAETWSPGAAVCTARRFPGRSIiAAFPRPAAPRRAVEMGE 
SSEDIDQMFSTIiLGEMDLLTQSLGVDTLPPPDPNPPRAEPMYSV 
GPKDriNESLKrAI,EDQDI*DALMADLVADISEAEQRTIQAQKESLQ 
NQHHSASLQAS 1 FSGAASLGYGTNVAATGISQYBDDLPPPPADP 
VLDLPLPPPPPEPLSQEEEEAQAKADKIKLALEKIJCEAKVKKIiV 
VKVHMNDNSTKSLMVDERQLARDVLDNLFBK7HCDCNVDWCr,YE 
lYPELQlERPFEDHENWEVLSDWTRDTENKILPLEKEEKYAVP 

KNPQNPYLDNRGKKESKETNEKMNAXNKESLLEVRLILQSGRKB 
KDVCS I FKS FASENNGKI 




6243 


1509 


614 - 


RSASRFSGCWSRDSTCCCCt>STCWSRSSASCPRARWPPSSAPAT 
TS RASSRRLACGPQTRAGAETRSTAMIRAKSAARDTRRATCRSA 
AGTPSPTTMTCLTDVPTGCAAVEPTARLPAAAWASTITTGCCPA 
MGQAGAGPAGRKGSEAGGGPGRAHHAHPSPLPREPRVRTGPPAH 
SPTPGSlDPSPELSWGSAGVTQESPLLDPVDFLIiFRTRAVDPLR 
RVFFFFYQHLTFFSIQPQPPPCHAFHPRDPPAGTKRQLILVPUC 
GPPILAPILSLTPILSRWSCYFPRSRIAQGWHIiS 




6244 


2119 


1745 


FEHAYASQFGTFLGNNESERCKIiKIiQQKTMSLWSWVNQPSELSK 
FTNPLFEANNLVIWPSVAPQSLPLWEGIFLRWMRSSKYLDEAYE 
EMVNI lEYNKELQAKVNILRRQLAELETEDGMQESP 


6245 


81 


1148 


LSLRNAKYSFFQELISLFSMTDIiNDNICKRYIKMITNIVILSLI 

ICI SloAFWI ISMTASTYYGNLRP 13 P WRWLFSVWP VLX VSNGL 

KKKSLDHSG/VLGGLWGFIIiTIANFSFFTSLLMFFLSSSKLTKW 

KGEVKKRLDSEYKEGGQRNWVQVPCNGAVPTELALLYKIEWGPG 

EIPVDFSKQYSASWMCLSLLAALACSAGDTWASEVGPVLSKSSP 

RLITTWEKVPVGTNGGVTWGLVSSLLGGTFVGIAYFLTQLIFV 

NDLDISAPQWPIIAPGGLAGLLGSIVDSYLGATMQYTGLDESTG 

MWNSPTNKARHIAGKPILDNNAVNLFSSVLIALLLPTAAWGPW 
PRG 


€246 


1177 


359 


SIjWPV7ILMDDSLMQISLQ1.LCVYTANFPNGCSSLCWSSCGQHPV 

qathrgavsnslmlcilkiasqmplentrvqqmvpmli.sklals 
hdckgviqksnflqnflslalpkggnkhlsnltilwlklllnis 
sgedgqqmilrjbdgcldlltemskyichksspllpll i fhnvcps 

PANKPKItiANEKVITVLAACLESENQNAQRiGAAALWALlYlTYQ 

KAKTALKSPSVKRRVDEAySLAKKTFPNSEANPIiHAYYLKCLBW 
LVQLLNSS 


6247 


3 


I u 


nsrvwgpwtepsagslrpmarkqnrnskblglvpiitddtshagp 
pgpgrallecdhlrsgvpggrrrkdwscsllvaslagafgssfL 
ygynls wnaptp yi kafynes werrhgrp idpdtltllwsvtv 
sifaigglvgtlivkmigkvlgrkhtllanngfaisaallmacs 

LQAGAFEMLIVGRFIMGIDGGVALSVLPMYLSEISPKEIRGSLG 
QVrAIFICIGVPTGQLLGLPELLGKESTWpyi.FGVIWPAWQL 
LSLPFLPDS PRYLLLEKHNEARAVKAFQTPLGKAHVSQEVBEVL 
AESRVQRSIRLVSVLELLRAPYVRWQVVTVIVIMACYQIjCGLNA 
IWFYTNSIFGKAGIPPAKlPYVTLSTGGIErLAAVFSGLVlEHli 
GRRPLLIGGFGLMGIiFFGTLTI TLTLQDHAPWVP YLS I VGXLAI 
lASFCSGPGGI PF I LTGEFFQQSQRPAAFI t AGTVWWLSNFAVG 
LLFPFIQKSLDTYCFLVFATICITGAIYLYFVLPETKMRTYASr 
SQAFS KRNKAYPPEEKIDSAVTDGKINGRP 


6248 


56 


1773 


VPPPRMMAAVPPGLE PWNRVR I PKAGNRSAVTVQNPGAAtDLCl""" 

AAVIKECHLVXLSLKSQTLDAETDVLCAVLYSNHNRMGRHKPHL 

ALKQVEQCLKRLKNMNLEGSIQDLPEIiPSSNENQPLTTKVCVVP 

SQPWELVLMKVI^CKLrirtRLLDCCCKTFLLTVKHr,GLQEFlI 

LNLVMVQLVSRLWVLYKGVLKRLILLYEPIiPGLLQEVARIQPMP 

yPKDFTFPSDITEPLGQPYPEAFKKKMPIAFAAKGIirKLUlKLP 

LINEQSPRASEETLLGISKKAKQMKINVQNNVDLGQPVKNKRVF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
locat ion 
corre spond i ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted en^ 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
se9[uence 


Amino acid segment containing signal peptide 
{A«Alanine, C=Cysteine, D=Aspartic Acid, E^c 
Glutamic Acid, F=PhenyIalanine, G=Glycine, 
HssHistidine, l!=Isoleucine, K=Lysine, 
L=Lieucine, M^Methionine, N^Asparagine, 
P=Proline, ' Q=Glutamine, R=Arginine, 
S -Serine, T=Threonine, v=Valine, 
W=Tryptophan, Y^Tyrosine, X^Unknown, *==Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








KBBSSEFDVRAFCNQLKHKATQETSFDFKCSQSRLKTTKYSSQK 
VIGTPHAKSFVQRFREAESFTQLSEEIQMAWWCRSKKLKAQAI 
FLGNKLLKSNRLKHLEAQGTSLPKICLECIKTSICNHLLRGSGIK 
TSKHHIiRQRRSQNKPLRRQRKPQRFCLQSTLLREIQQFSQGTRKS 
ATDTSAKWRLSHCTVHRTDIjyPNSKQLLNSGVSMPVlQTKEKMI 
HENLRGIHENETDSWTVMQINKNSTSGTIKETDDIDDIFALMGV 


6249 


56 


1773 


VPPPRMMAAVPPGLEPWNRVRlPKAGimSAVTVnNPnanr nrn 
AAVIKECHLVILSLKSQTLDAETDVIjCAVLYSNHNRMGRHKPHL 
ALKQVEQCLKRLKNMNLEGSIQDLFELFSSNENQPLTTKVCWP 
SQPWEIjVLMICVLGACKLLLRLLDCCCKTFLLTVKHLGLQEFII 
LNLVMVGLVSRLWVLYKGVLKRLILLYBPLPGLLQEVARIQFMP 
YFKDFTFPSDITEFLGQPYFEAFKKKMPIAFAAKGINKLLNKLF 
LINEQSPRASEETLLGISKKAKQMKINVQNNVDLGQPVKNKRVF 
KEBSSEPDVRAFCNQIiKHKATQETSFDFKCSQSRIiKTTKYSSQK 
VIGTPHAKSFVQRFREAESFTQLSEEIQMAWWCRSKKIjKAQAI 
flgnkllksnrlkkleaqgtslpkkleci kts i cnhllrgsgi k 
TSKHHLRQRRSQNKFLRRQRKPQRKLQSTLLREIQQFSQGTRKS 

atdtsakwrlshctvhrtdlypnskqllnsgvsmpviqtkekmi 
henlrgihenetds wtvmqinknstsgtiketddidd i faiimgv 


6250 


232 


1306 


LAALHIMALPFRKDLEKYKDLDBDELLGNLSETEtiKQLETVLDD 
LDPENALLPAGFRQKNQTSKSTTGPFDREHLIiSYIiEKEALEHKD 
REDYVPYTGEKKGKIFIPKQKPVQTFTEEKVSLDPELEEALTSA 

EKILPVFDEP PNPTNVEESLKRTKEWDAHLVE VKLNNl KH IPIP 

tlkdfakaletnthvkcfslaatrskdpvatafaemlkvnktlk 
slnvesnfitgvgilalidalrdnetlaelkidnqrqqlgtave 
lemakmleentnilkfgyqftqqgprtraanaitknndlvrkrr 

VEGDHQ 


6251 


62 


972 


TPGSGPMSAWAAASIiSRAAARCLIiARGPGVRAAPPRDPRPSHPE 
PRGCGAAPGRTLHFTAAVPAGHNKWSKVRHIKGPKDVERSRIFS 

klclnirlavkeggpnpehmsnlanilevcrskhmpfcstietal 
kmekskdtyllyegrgpggsslliealsnsshkcoadirhilnk 
nggvmavgarhsfdkkgviwevedrekkavnleralemaieag 
aedvketedeeernvpkficdasslhqvrkkldslglcsvscal 

EFI PNSKVQLAEPDLEQAAHLIQAIjSNHEDVIHVYDNIB 


6252 


27 


1897 


eefctwiavrvgbmetapkpgkdvp p kkdklqtkrkkprrywbe "" 

ETVPTTAGASPGPPRNKKNRELRPQRPKNAyiLXKSRISKiCPQV 
PKKPREWKNPESQRGIiSGAQDPFPGPAPVPVEWQKFCRlDKSR 
KLPHSKAKTRSRLEVTiEABEEETSIKAARSELtLAEEPGFLEGB 
DGEDTAKICQADIVEAVDIASAAKHPDLNLRQFGPYRLNYSRTG 
RHLAFGGRRGHVAALDWVTKKLMCEINVMEAVRDIRFLHSEALL 
AVAQNRWLHiYDNQGlBIiHCXRRCDRVTRLEFrjPFHFLIiATASE 
TGFLTYLDVSVGKIVT^NARAGRLDVMSQNPYKAVIHLGHSNG 
TVSLWSPAMKEPLAKILCHRGGVRAVAVDSTGTYMATSGLDHQL 
KrFDLRGTYQPLSTRTLPHGAGHLAPSQRGLLVAGMGDWNIWA 
GQGKASPPSLEQPYLTHRLSGPVHGLQFCPFEDVLGVGHTGGIT 
SMLVPGAGEPNFDGLESNP YRSRKQRQEWEVKALLEKVPAEL I C 
LDPRAIAEVDVISLEQGKKEQIBRLGYDPQAKAPFQPKPKQKGR 

SSTASLVKRKRKVMDEEHRDKVRQSLQQQHHKEAKAKPTGARPS 
ALDRFVR 


6253 


27 


1897 


BEFCTWIAVHVGEMETAPKPGKDVPPKKDKLQTKRKKPRRYWEE 
ETVPTTAGASPGPPRNKKNRELRPQRPKNAYILKKSRISKKPQV 
PKKPREWKNPESQRGLSGAQDPPPGPAPVPVEWQKFCRIDKSR 
KIiPHSKTUCTRSRIiBVAEABEEETS IKAARSBLLLAEEPGFLEGE 
DGEDTAKICQADIVEAVDIASAAKHPDLNLRQFGPYRLNYSRTQ 
RHIAFGGRRGHVAALDWVTKKLMCEINVMEAVRDIRFLHSEALL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
(A=Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, Phenyl alanine, G=Glycine, 
H=Histidine, I-Isoleucxne, KaLysine, 
LsLeucine, M=Methionine, Ns=Asparagine, 
P=*Proline, Q^Glutaraine, R^Arginine, 
S^Serine, T=Threonine, v= Valine, 
W=Tryptophan, y=Tyrosine, X«»Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AVAQNRWLH I YDNQGI ELHCI RRCDRVTRLE FLPFHFLLATASE 
TGFLTYLDVSVGKIVAALNARAGRLDVMSQNPYNAVIHLGHSNG 
TV^LWSPAMKEPIiAKILCHRGGVRAVAVDSTGTyt4ATSGLDHQL 
KIFDLRGTYQPLSTRTLPHGAGHLAFSQRGLLVAGMGDWNIWA 
GQGKASPP SLEQP YLTHRLSGPVHGLQFCP FEDVLGVGHTGGIT 
SMLVPGAGEPNFDGLESNPYRSRKQRQBWEVKALLEKVPAELIC 
LDPRALAEVDVI SLEQGKKEQI ERLGY DPQAKAPFQPKPKQKGR 
SSTASLVKRKRKVMDBEHRDKVRQSLQQQHHKEAKAKPTGARPS 
ALDRFVR 


6254 


155 


1139 


HALGRRGGSQELSAAACGCFALRLRAPGSGRPALAPGAAAFAGL 
GGAPRFPPRGSAAGRTMLLKEYRICMPLTVDEYKIGQLYMrSKH 
SHEQSDRGEGVEWQNEPFEDPmiGNGQPTEKRVYLNSKLPSWA 
RAWPKIFYVTEKAWNYYPYTITEYTCSPLPKFSIHIETKYEDN 
KGSNDTI FDNEAKDVE RBVCFI DIACDEI PERYYKESEDPKHFK 
SEKTGRGQLREGWRDSHQPlMCSYKtiVTVKFEVWGLQTRVEQFV 
HKVVRDILLIGHR0ftFAWVnFMVnMTMnn\/01?VPtnsTMlJl?rk»nKTTV 

VCNQHSS PVDDI ESHAQTS T 


6255 


1 


1444 


PTRPQQELLVSIiATVIFVASQKALSVESKAVIKQQLESVSNGWT 
VYRIAROASRMGNHIMAKELYQSLLTQVASKHPyPMLNSliKEFS 
HAEQCLTGLQEENYSSAIiSCIAESLKFYHKGIASLXAASTPLNP 
LSFQCEPVKLRlDLIiOAFSQLlCTCNSr.KTSPPPAIATTIAMTIi 
GNDLQRCGRISNQMKQSMEEFRSLASRYGDLYQASFDADSATLR 
NVELQQQSCIiLISHAIEAIilLDPESASFQEYGSTGTAHADSEYE 
RRMMSVYNHVLEEVESLNGKYTPVSYMHTACLCKAIIALIiKVPL 
SFQRYFFQKLQSTSIKLALSPSPRNPAEPIAVQNNQQLALKVEG 
WQHGSKPGLFRKIQSVCLNVSSTLQSKSGQDYKIPIDNMTNEM 
EQRVEPHNDYFSTQPLLNFAILGTHNITVESSVKDANGIVWKTG 
PRTTIPVKSLEDPySQQIRLQQQOAOOPIjOOOOORNAYTRP 


6256 


1 


1542 


CRGAGAEPAANPRSPRSLVPSLESTSTSVPPAPGTMATDSWAIiA 
VDEQEAAAESLSNLHLKEEKIKPDTNGAWKTNANAEKTDEEEK 
EDRAAQSLLNKLIRSNLVDNTNQVEVLQRDPNSPLYSVKSPESL 

rlkpqllqGvyamgfnrpskiqenalplmlaeppqnliaqsqsg 
tgktaafvlamlsqvepankypqclclgptyeiialqtgkvieqm 

GKFYPELKLAYAVRGNKLERGQKI SEQIVIGTPGTVIiDWCS KLK 
FIDPKKI KVFVLDEADVMI ATQGHQDQS IRIQRML PRNCQMLI.F 
SATPEDSVWKFAQKWPDPNVIKLKREEETLDTIKQYYVIiCSSR 
DEKFQAIiCNLYGAITIAQAMIFCHTRKTASWLAAELSKEGHQVA 
LLSGEMMVEQRAAVIERFREGKBKVLVTTNVCARG IDVEQVS W 
INFDLPVDKDGNPDNETYLHRIGRTGRFGKRGLAVNMVDSKHSM 
NILNRIQEHFNKKIERLDTDDLDEIEKIAN 


6257 


210 


615 


AFIPAMAELIQKKLQGEVEKYQQLQKDLSKSMSQRQKLEAQLTE 

NNIVKEELALLDGSNWFKLLGPVLVKQELGE/iRATVGBCRLDYI 

rAEIKRYESQLRDIiERQSEQQRETLAQLQQEFQRAQAAKAGAPG 
KA 




210 


615 


AFIPAMAELIQKKLQGBVEKYQQLQKDLSKSMSGRQKLEAQLTE" 

NNIVKEELALLDGSNWFKLLGPVLVKQELGEARATVGKRLDYI 

TAEIKRYESQLRDLERQSEQQRBTLAQLQQEFQRAQAAKAGAPG 
KA 


62S9 


2 


1540 


IIiEKGFPSQCHPERKWKVDDVLESSQENEDDHFWELLFHNNKTV 
S VENGDRGSKTFNLGrDP VSLRNYP YK2 CDSCBMNLKMTISGL 1 1 
SKKNCSRKKPDEFNVCEKLLLDIRHEKIPIGEKSYKYJDQKRNAI 
NYHQDLSQPSPGQS FE YSKNGQGPHDETIAFFTNKRSQ IGETVCK 
YNECGRTPIESLKLNISQRPHLEMEPYGCSICGKSPCMNLRFGH 
QRALTKDWPYEYNEYGEIPCDNSAFIIHQGAYTRKILREYKVSD 
KTMEKSALLKHQIVHMGGKSYDYNENGSKFSKKSHLTQLRRAHT 
GEKTFECGECGKTFWEKSNLTQHQRTHTGEKPYECTECGKAFCQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide" 

\*i-rtj.aii.j.ues/ v.yfciteine^ JLi=ASparCxC ACiO., E=; 
Glutamic Acid, F== Phenylalanine, G=Glycine, 
H:=Histidine, I^Isoleucine, Kr^Lysine, 
L=sLeucine, M=Methionine, N»Asparagine , 
P«Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=sThreonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X:=Unknown, *=Stop 
Codon, /=:possible nucleotide deletion, 
\=poBsible nucleotide insertion) 










KPHLTNHQRTHTGEKPYECKQCGKTFCVKSNIjTEHQRTHTGEKP 
YECNACX3KSPCHRSALTVHQRTHTGEKPFICNECGKSFCVKSNL 
IVHQRTWTGEKPYKCNECGKTFCEKSALTKHQRTHTGEKPyECN 
ACGKTFSQRSVLTKHQRIHTRVKALSTS 




6260 


2081 


1436 


GTGPEIHACAHASARAPGSRAMALRELKVCLLGDTGVGKSSIVW 
RFVEDSFDPNINPTTGASFMTKT'VrivnWRT.wvTrT.TMn'rRr'rin'iDi? 

RALAPMYVRGSAAAI I VYDITKESTPSTLKNWVKELRQHGPPN I 
WAIAGNKCDLIOVREVMERDAKDYADSIHAIFVETSAKWAIWI 
NELFIEISRRIPSTDANLPSGGKGFBCLRRQPSEPKRSCC 




6261 


3 


1188 


FWYRLGPGTRSRWPRRGSWAASLVPRGPSPAALVTSPCPPDPIiR 
SPACEPCRPDFAPRPALLLRSGPRSAPAVTGKPALKGQPGPWPG 
MAEVSIDQS KLPGVKEVCRDFAVLEDHTLAHSLQEQE lEHH IAS 
NVQRNRLVQHDLQVAKQLQEEDLKAQAQU3KRYKDLEQQDCEIA 
QEIQEKLAIEAERRRIQEKKDEDIARLLQEKELQEEKKRKKHFP 
EFPATRAYADSYYYEDGGMKPRVMKEAVSTPSRMAHRDQEWYDA 
EIARKLQEEELLATQVDMRAAQVAQDEE I ARLLMAEEKKAYKKA 
KEREKSSLDKRKQDPEWKPKTAKAANSKSKKSDEPHHSKNERPA 
RPPPPIMTDGBDADYTHPTNQQSSTRHFSKSESSHKGFHYKH 




6262 


2 


1759 


PECHSQGIiCSVHRPGKVPQARMSGLVLGQRDEPAGHRLSQEEIJL 
GS TRLVSQG LEALRS EHQ A VLQS TiS QTIE CLQQGGHEEGL VHEK 
ARQLRRSMENIELGLSEAQVMLAIiASHLSTVESEKQKLRAQVRR 
LCQENQWLRDELAGTQQRLQRSEQAVAQLEEEKKHLEPLGQLRQ 
YDEDGHTSBBKEGDATKDSLDDLFPNEEEEDPSN6LSRGQGATA 
AQQGGYEIPARLRTLHNIiVIQYAAQGRYEVAVPLCKQALEDLER 
TSGRGHPDVATMLNILAIiVYRDQNKYKEAAHLIjNDAIjSIRESTL 
QPDHPAVAATLNNIiAVLYGKRGKYKEAEPLCQRAIjEIREKVIjGT 
NHPDVAKQLNNLALLCQNQGKYEAVERYYQRAIiAIYEGQLGPDN 

pnvartknklascylkqgkyaeaetlykeiltrahvqefgsvdd 
DHKP I wmhaeereemsksrhheggtpyaeyggwykackvss ptv 

NTTIJUinjGau:YRRQGKLEAAETLEECyU.RSRRQGTDPISQTKVA 

ellgesdgrrtsqegpgdsvkpeggedasvavewsgdgsgtlqr 
sgslgkirdvlrr 


6263 


1 


2408 


reldsiadlperikppyanglstshlrsssvedvkliisegrpt 
ievrrcsmpsvicehtkqfqtiseesnqgslltvpgdtspspkp 
bvfsnvpbrdlsnvsnihssfatsptgasnskyysadrnl i knt 
apvmtvmdspvhlepssqvgviqnkswempvdrlbtlstrdfic 

PNSNIPDQESSLQSFCWSENKVLKENADFLSLRQTELPGNSCAQ 
DPASFMPPQQPCSPPSQSLSDAESISKHMSLSYVANQEPGILQQ 
KNAVQ 1 1 SSALDTDNE S T KDTENTPVLGDVQKTDAFVP VYSDS T 
IQEASPNFEKAYTLPVLPSBKDFNGSDASTQLNTHYAPSKLTYK 
SSSGHEVENSTTDTQVISHEKENKLESLVLTHLSRCDSDLCEMN 
AGMPKGNLNEQDPKHCPESEKCLLS lEDEESQQS I LSSIiENHSQ 
QSTQPEMHKYGQLVKVELEEWABDDKTENQI PQRMTRNKRNTMA 
NQSKQIIASCTLLSEKDSESSSPRGRIRLTEDDDPQIHHPRKRK 
VSRVPQPVQVSPSLLQAKEKTQQSIAAIVDSLKLDEIQPYSSER 
ANPYPEYLHIRKKIEEKRKLLCSVIPQAPQYYDBYVTFNGSYLL 
DGNPLSKICIPTITPPPSLSDPLKELFRQQEWRMKLRLQHSIE 
REKLIVSNEQE\rbRVHYRAARTLANQTLPFSACTVLLDAEVYNV 
PLDSQSDDSKTSVRDRFNARQFMSWLQDVDDKFDKLKTCXiLMRQ 
QHEAAALNAVQRLEWQLKLQELDPATYKSISIYEIQEFYVPLVD 
VNBDFELTPI 


6264 


143 


1960 


KHRQENNALDMAPEIHMTGPMCLIENTNGELVANPEAliKILSAI 
TQP WWAI VGLYRTGKS YLMNKLZiGKNKGFSLGSTVKSHTKG I 
WMWCVPHPKKPEHTLVLLDTEGLGDVKKGDNQNDS W 1 FTLAVLL 
SSTLVYNSMGTIKTQQAMDQLYYVTELTHRIRSKSSPDENENEDS 
ADFVSFFPDFVWTLRDFSLDIjEADGQPLTPDEYLEYSLKXTQGT 
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S£Q 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=iLysine, 
L=Leucine, M-Methionine, NaAsparagine, 
P-Proline, Q-Glutamine, R^Arginine, 
S=Serine, T=Threonine, V^Valine, 
W»Tryptophan, Y=Tyrosine, X=lXnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








SQKDECNFWLPRLCIHKFFPKKKCFVFDLPIHRRKLAQLEKLQDE 

EJjDPEFVQQVADPCSY r psnsktktlsggi kvngprleslvlty 
IMAISRGDLPCMENAVXALAQlENSAAVQKAIAHyDQQMGQKVQ 
LPAETLQELLDLHRVSEREATEVYMKWrSFKDVDHLPQKiCLAAQL 
DKKRDDFCKQNQEAS SDRCSALLQVI PSPIiEEEVKAG I YSKPGG 
yCLFIQKIiQDLEKKYYEEPRKGIQAEEILQTYLKSKESVTDAIL 
QTDQILTEKEKEIEVECVKAESAQASAKKVEEMQIKYQQMMEEK 
EKSYQEHVKQIiTEKMERERAQLLEBQEKTLTSKLQEQARVLKER 
CQGESTQLONEIQKLQKTLKKKTKRYMSHKLKI 


626S 


143 


1960 


KHRQENNALDMAPEIHMTGPMCLIENTNGELVANPEAIiKILSAI 
TQPVVWAIVGLYRTGKSYLhiNKLAGKNKGPSXiGSTVKSHTKGI 
WM WCVPH P KKP EHTLVLLDTEGLGDVKKGDWQNDSW I FTLAVLL 
SSTLVYNSMGTINQQAMDQLYYVTELTHRIRSKSSPDENENEDS 
ADFVSFFPDFVWTLRDFSLDLEADGQPLTPDEYIiEYSLKLTQGT 
SQKDKNFWLPRLCIRKFPPKKKCFVFDLPIHRRKLAQLEKLQDE 
ELDPEFVQQVADFCSYIFSNSKTKTLSGGIKVNGPRLESLVLTY 
INAISRGDL PCMENAVliAIiAQ I ENSAAVQKAIAHYDQQMGQKVQ 
LPAETLQELLDLHRVSEREATEVYMKNSFKDVDHLFQKKIiAAQIj 
DKKRDDFCKQKQEASSDRCSALIjQVIPSPIjEEEVKAGIYSKPGG 
YCLPI QKIiQDIiEKKY YEEPRKG I OAEEI LQT YIiKSKES VTDAIL 

qtdqiltekekeievecvkaesaqasakktveemqikyqqmmeek 
eksyqehvkqltekmereraqi*leeqektltskxqeqarvi>ker 
cqgestqlqneiqklqktlkkktkrymshklki 




276 


1421 


GSHQKQMLVPCPLYSLQNRKPSLYGSLTCQGIGLDGIPEVTASE 
^it I viNJiXiNK.Jw>iHlSCPKENASSKFIiAPYTTFSRIHTKSITCLD 
ISSRGGLGVSSSTDGTMKIWQASNGELRRVLEGHVPDVNCCRFF 
PSGLWLSGGMDAQLKIWSAEDASCWTPKGHKGGILDTAIVDR 
GRNWSASRDGTARLWDCGRSACLGVUUJCGSSINGVAVGAADN 
S INLGSPEQMpSBREVGTEAKMLLLAREDKKLQCLQl^iS RQLVF 
LFIGSDAFNCCTFLSGFIiLLAGTQDGNIYQLDVRSPRAPVQVIH 

rsgapvi:.sllsvrdgpiasqgdgscfivqqdldyvteltgadcd 
pvykvatwekqiytccrdglvrryqlsdl 


5257 


3 


622 


lgmmkknnsakrgpqdgnqqpappekvgwvrkpcgkgifreiwk ■ " 
nrywlkgdqlyisekevkdekniqevfdlsdyekceelrksks 

ITRAKNRILDEVTVEEDSYIiAHPTRDRAKIQHSRRPPTRGHLMA 
VASTSTSDCMLTIiDLIQEEDPSPEEPTSLC 


6268 


160 


1368 


HRELCQNLPAGLSSALIDNPLTLLLSIDTYVMLQEPVTPQDVAV 
DFSREEWGt.LGPTQRTBYRDVMLETFGHLVSVGWBTTLENKELA 
PNS D I PEEEPAPSLKVQESSRDCALSSTtiEDTLQGGVQE VQDTV 
LKQMESAQEKDLPQKKHFDNRESQANSGAIJ)TKQVSLQKIDNPE 
SQANSGALDTNQVLLHKIPPRKRLRKRDSQVKSMKHWSRVKIHQ 
KSCERQKAKEGNGCRKTFSRSTKQITFIRIHKGSQVCRCSECGK 
IFRNPRYFSVHKKIHTGERPYVCQDCGKGFVQSSSLTQHQRVHS 
GERPPECQECGRTFNDRSAISQHLRTHTGAKPYKCQDCGKAPRQ 
SSHLIRHQRTHTGERPYACNKCGKAFTQSSHLIGHQRTHNRTKR 
ECKKQPTS 


6269 " 


28B6 


1449 


HASAPTRRNMAAASPLRDCHAWKDARLPLSTTSNEACKLFDATL 
TQYVKWT^roKS LGGIEGCLS KLKAADPTFVMGHAMATGLVLIGT 
GSSVKLDKELDLAVKTMVEISRTQPLTRREQLHVSAVETFANGN 
FPKACELWEQILQDHPTDMLALKFSHDAYFYLGYQEQMRDSVAR 
I yPFPra'PDIPl.SS YVKGI YSFGIiMETNPYDQAEKIAKEALSINP 
TDAWSVHTVAHIHEMKAE I KDGLEFMQHSETLWKDSDMLACHNY 
WHWAIiYIiIEKGEYEAALTIYDTHILPSLQANDAMLDWDSCSML 
YRLQMBGVSVGQRWQDVLPVARKHSRDHILLFNDAHFLMASLGA 
HDPQTTQELLTTLRDASESPGENCQHLLARDVGLPLCQALVEAE 
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SEQ 
ID 
NO: 


PirectiGted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide " 
(A=Alanine, C=Cysteine, D=Aspareic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=^Lysine, 
L=:Leucine, M=Methionine, N=AsparagiAe, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
W=Tryptophan, Y-Tyrosine, X«Unknown, *:=Stop 
Codon, /-possible nucleotide deletion, 
\«possible nucleotide insertion) 


6270 






DBNPDRVLfclbLLPIRYRIVQLGGSNAQRDVFNQLLlHAALNCTS 
SVHKNVARSLLMERDALKPNSPLTERLIRKAATVHLMQ 


6271 


23 


2086 


SVaVTLGSEGDUKPPTyHJLBBMEQLlr^Cfi^GEPAEXKIIREAYKKA 

FLFVNKGLNTDELGQKEEAKNYYXQGIGHLLRGISISSKESEHT 

GPGWESARQMQQKMKETLQNVRTRLEILEKGIATSLQNDLQEVP 

KLYPEFPPKDMCEKIiPEPQSFSSAPQHAEVNGNTSTPSAGAVAA 

PASLSLPSQSCPAEAPPAYTPQAAEGHYTVSYGTDSGEFSSVGE 

EFYRNHSQPPPLETLGLDADELILIPKGVQIFFVNPAGEVSAPS 

YPGYLRlVRFLDNSLDTVLNRPPGFLQVCDWXiYPL VPDRSPVLK 

CTAGAYMFPDTMLQAAGCFVGWLSSELPEDDRELFEDLLRQMS 

DLRLQANWNRAEEENEFQIPGRTRPSSDQLKEASGT0VKQLDQG 

WKDVRHKGKRGKRAKDTSSEEVNLSHIVPCEPVPEEKPKELPEW 

SEKVAHNILSGASWVSWGLVKGAEITGKAIQKGASKLRERIQPB 

EKPVEVSPAVTKGr.YIAKQATGQAAKVSQPLVDBVCTVANCVGK 

EliAPHVKKHGSKLVPESIiKKDKDGKSPriDGAMWAAS S VQGFST 

VWQGLECAAKCrVNNVSABTVQTVRYKYGYNAGEATHHAVDSAV 

KVGVTAYWINNIGIKAMVKKTATQTGHTLLEDYQIVDNSQRENO 
EGA7WJVNVRGE KDE QTKE VKEAKKKDK 


6272 


32 


10^6 


GCGVKTAGMVGREKELSIHFVPGSCRLVBEEVNIPNRRVT.wrTi 
TGLLGRAVHKEFQQNNWHAVGCGFRRARPKPEQVWLLDSNAVHH 
I IflDPQPHVIVHCAAERRPDWENQPDAASQIiNVDASGNLAKEA 
AAVGAFLIYISSDYVFDGTNPPYREEDIPAPLNLYGKTKLDGEK 
AVLENNLGAAVLRIPILYGEVEKIiEESAVTVMFDKVQFSNKSAN 
MDHWQQRFPTHVKDVATVCRQLAEKRMi:jDPSIKGTFHMfiRHI?nM 

TKYEMACAIADAFNLPSSHLRPITDSPVLQAQRPRNAQLDCSKL 
ETLGIGQRTPPRIGIKESLWPFLIDKRWRQTVFH 


6273 


1136 


528 


UAVMBDAAAPGRTBGVLERQGAPPAAGQGGALVBLTPTPGGIJVL 
VSPYHTHRAGDPLDLVAIAEQVQKADSFIRANATNKLTVIAEQI 
QHLQEQARKVLEDAHRDANLHHVACNTIVKKPGNIYYLYKRESGQ 
QYFSIlSPKEWGTSCPHDFLGAYKIiQHDLSWTPYEDIEKQDAKl 
SMMDTIiLSQSVALPPCTEPNFQGLTH 


6274 


256 


843 


bQ_PKVSPECRSIiGCQVMFSLPLNCSPDHIRRGSCWGRPQbl.KIA 
SAAWNSKCHPGAGAAMARQHARTLW YDRPRYVPME FCVEDSTDV 
HVI,IEDHRIVFSCKNADGVELYNEIEFYAKVKSKDSQDKRSSRS 

ITCPVRKMKEKVAWPRLTKEDIKPWLSVDFDNWRDMEGDEEME 
LAHVEHYAEVRDNTYCVLPT 


6275 


56 


1142 

■ 


AftAAMAAAAGGGAGAARSLSRFRGCLAGALLGDCVGSFYEAHDT" 

VDLTSVLRHVQSLEPDPGTPGSERTEALVYTDDTAMARALVQSIi 

LAKEAFDEVDMAHRPAQEYKKDPDRGYGAGVVTVFKKLLNPKCR 

DVFEPARAQFNGXGSYGNGGAMRVAGISLAYSSVQDVQKFARLS 

AQLTHASSLGYKGAILQAIAVHIALQGESSSKHFLKQLLGHMED 

LEGDAQSVUDAREIiGMEERPYSSRLKKIGELIiDQASVTREEWS 

ELGNGIAAFESVPTAI YCPLRCMEPDPSr PSAFNSIK2RTI>I YS I 

SLGGDTDTIATMAGAIAGAYYGMDQVPESWQQSCEGYEETDILA 
QSLHRVFQKS 


6276 


20 


565 

] 
] 


SKKUKARCIAKGaKRPVPRPAKTMAFMVKTMVGGQLKMLlfeSLG 

SGEDKGDGDKSAAEAQGMSREEYEEYQKQLVEEKMERDAQFTQR 

KABRATUlSHFRDKYRLPKNETDESQIQMAGGDVEliPRELAKMI 

SEDTEEEEEKASVLGQLASLPGUffLGSLKDKAQATLGDLKQSAE 
<CHVM 




757 


37 

S 
\ 
I 

,1 


[■JjLPLPPLPDtkgmh>LNTGLEGTVAENPVPIVHTPSGMILTLS " 

SCU5QLATHPGHWGIHLQIAEPAALRPSIALIARLSSLGLLHWP 

WGAKISHGSFSVPGHVAGRELLTAVAEVFPHVI'VAPGWPEEV 

^SGYREQLLTDMLKLCQGLWQPVSFQMQAMLriGHSTAGAIGRL 

xASSPRATVTVEHNPAGGDYASVRTALIiAARAVDRTRVYYRLPO 
YHKDLLAHVGRN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
CO rre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptid~ 
(A=Alanine, C-.CyBteine, D=Aspartic Acid, 
Glutamic Acid, F=. Phenyl alanine, G=Glycine, 
HaHistidine, I=Isoleucine, K=Lysine, 
L^Leucine, M^Methionine, N=Asparagine, 
P-Proline, Q=Glutamine, R=Arginine, 
S=Serine, Ti=Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X»Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6277 


4600 


2744 


MAFRTEMGI^YYSYPKTIVEAPSFLNGVWMIMNDKLTEYPLVINT 
LKRPNLYPEVILASWYRlYTKIMDLIGrOTKICWTVTIGEGIiSP 
TESCEGLGDPACFYVAV3 FI LNGLMMALFFI YGT YLSGSRLGGL 
VTVLCFFFNHGECTRVMWTPPriRESFSYPFLVLQMLLVTHILRA 
TKLYRGSLIALCISNVFFMLPWQPAQFVLLTQIASLPAVYWQY 
IDICKLRKIlYIHMISLALCFVLMFGNSrOjLTSYYASSLVriWG 
ILAMKPHFLKlNVSELSLWVIQGCFWLPGTVILKrYLTSKIFGIA 
NDAHIGNLLTSKFFSYKDFDTLLYTCAAEFDFMEKSTPLRYTICr 
LLLPWIiVGFVAIVRKI I SDMWGVLAKQQTHVRKHQFDHGEL VY 
HALQLLAYTALGILIMRLKL?LTPHMCV^4ASL1CSRQLFGMLFC 
KVHPGAI VFAIIiAAMS I QGSAWLQTQMKriVGEPSNLPQEEL lEW 
I KYSTKPDAVFAGAMPTMAS VXLSALRP I VNHPH YEDAGLRART 
KIVYSMYSRKAAEEVKRELI KLKVKY YILEESWCVRRSKPGCSM 

PEIWDVEDPAMAGKTPLCNLLVKDSKPHPTTVFQNSVYKVLEW 
KE 


6278 


3 


823 


i LFRIiVLLSLVYLLNS VATE ERKPAEVLI VEGQQYAWGT V1,L~ 

IRXILEYCQG VDNX PSVTTDMLTRLSDItLKYFNSRS CQLVL6AG 

ALQWGLKTITTKMIiAL^SSRCLQLIVHYIPVIRAHFEARLPPKQ 

YSMLRHFDHITKDYHDHIAEISAKLVAIMDSLPDKLLSKYEVKA 

PVPSACFRNICKQMTKMHEAIPDLLPEEQTQMLFLRINASYKLH 

LKKQLSHUJVlNDGGPQNGIiVTADVAFYTGNIiQAIiKGIiKDIiDIjN 
MAEIWEQKR 


6279 


127 


1687 


GGAMASDGARKQFWKRSNSKLPGSIQHVYGAQHPPFDPLLHGTL "" 

LRSTAKMPTTPVKAKRVSTFQEFESNTSDAWDAGEDDDELLAMA 

AESLNSEWMETANRVLRNHSQRQGRPTLQEGPGLQQKPRPEAE 

PPSPPSGDLRLVKSVSESHTSCPAESASDAAPLQRSQSLPHSAT 

VTLGGTSDPSTLSSSALSEREASRLDKPKOLLAGPNTDLEELRR 

LSWSGIPKPVRPMTWKTiLSGYLPANVDRRPATLQRKQKSYFAPl 

EHYYDSRNDEVHQDTYRQIHIDIPRMSPEALILQPKVTEIFERI 

LFIWAIRHPASGYVQGINDLVTPFFWPICEYIEAEEVDTVDVS 

GVPAEVLCNIEADTYWCMSKLLDGIQDNYTFAQPGIQMKVKMLE 

EIjVSRIDEQVHRHLDQHEVRYLQFAFRWMNNLLMREVPLRCTIR 

LWDTYQSEPDGFSHFHLYVCAAFliVRWRKEILEEKDPQELLLFL 

QNIiPTAHWDDBDISLLlxAEAYRLKFAPADAPNHYKK 


6280 


857 


251S 


ECCDQKMGSRNSSSAGSGSGDPSEGLPRRGAGLRRSEEEEEEDE 
DVDLAQVLAYLLRRGQVRLVQGCGAANLQFIQALLJDSEEENDRA 
WDGRLGDRYNPPVDATPDTRELEFNEIKTQVELATGQLGLRRAA 
QKHSFPRMLHQRERGLCHRQSPSLGEQSRVISHFLPNDLGFTDS 
YSQKAFCGIYSECDGQIFMSACQDQTIRLYDCRYGRFRKFKSIKA 
RDVGWSVLDVAFTPDGNHFLYSSWSDYIHICNIYGEGDTHTALD 
LRPDERRFAVPSIAVSSDGREVLGGANDGCLYVFDREQKRRTLQ 
lESHEDDVNAVAPADlS SQ ILFSGGDDAICKVlflDRRTMREDDPK 
PVGAIiAGHQDGITFIDSKGDARYLISNSKDQTIKI>WDIRRFSSR 
EGMEASRQAATQONWDYRWOOVPKKAWR Kr.If "L PRn*? C T .mt vpr'u 

GVLHTLIRCRFSPIHSTGQQFIYSGCSTGKVWYDLLSGHXVKK 
LTCIHKACVRDVSWHPPEEKIVSSSWDGNLRLWQYRQAEYFQDDM 
PESEECASAPAPVPQSSTPFSSPQ 


6281 


857 


2515 


ECCDQKMGSRWSSSAGSGSGDPSEGLPRRGAGLRRSEEBEEEDE 
DVDLAQVLAYLLRRGQVRLVQGGGAANLQFIQALLDSEEEITORA 
WDGRLGDRYNPPVDATPDTRELEFNEIKTQVBLATGQLGLRRAA 
QKHSPPRMLHQRERGLCHRGSFSliCEQSRVISHFLPNDLGFTDS 
YSQKAFCGIYSKDGQIFMSACQDQTIRLYDCRYGRFRKFKSIKA 
RDVGWSVLDVAFTPDGNHFIjYSSWSDYIHICNiyGEGDTHTALD 
LRPDERRFAVFSIAVSSDGREVLGGANDGCLWPDREQNRRTLQ 
lESHEDDVKAVAPADISSQILFSGGDDAICKVWDRRTMREDDPK 
PVGALAGHQDGITP2 DSKGDARY LISJSTSKDQTI KLWDIRRFSSR 
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SEQ 
ID 

viU : 


Predxcted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acra segment containing signal peptide 
{A=Alanine, C^Cysteine, D=Asnartic Aniri t? 
Glutamic Acid, P=Phenylalaniiie, G=Glycine, 
H=Histidine, I«Isoleucine, K=Lysine, 
L-Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X^Unknown *=<?<-r^r, 
Codon, /=po3sible nucleotide deletion, 
\=^possible nucleotide insertion) 


6282 






EGMSASRQAAi-UUKWDYRWUUVPKKRWRKLKLPCJDSSLMTYRQH 
GVLHTLIRCRFSPIHSTGQQFiySGCSTGKWVyDLLSGHIVKK 
LTNHKACVRDVSWHPFEEKIVSSSWDGKLRLWQYRQAEYFQDDM 
PESEECASAPAPVPQSSTPFSSPQ 


6283 


125 


906 


RMAACRALiKAVLVDLSGTLHIEDAAVPGAQEALkRI.RGAq VT tp ' - 

PVTNTTKESKQDLLERLRKLEFDISEDEIFTSLTAARSLLERKQ 

VRPMLLVDDRALPDPKGIQTSDPWAWMGLAPEHFHYQILNQAF 

RLLLDGAPLIAIHKARYYKRKDGLALGPGPFVTALEYATDTKAT 

WGKPEKTFFLEALRGTGCEPEEAVMIGDDCRDDVGGAQDVGML 

GILVKTGKYRASDEEKINPPPYLTCESFPHAVDHILOHLL 


6284 


140 


1043 


i.ax,FGIHVM«Ft.-wSMSTSSVRKki>i!iaHi;;KTLTGDVKTSPPRTAP"" 

KKQLPSIPKNALPITKPTSPAPAAQSTNGTHASYGPFYLEYSLL 

AEFTLWKQKLPGVYVQPSYRSALMWFGVIPIRHGLYQDGVFKF 

TVYIPDWYPDGDCPRLVFDIPVFHPLVDPTSGBIiDVKRAFAKWR 

RKHNHIWQVLMYARRVFYKIDTASPLNPEAAVLYEKDIQLFKSK 

WDSVKVCTARLFDQPKIEDPYAISFSPWNPSVHDEAREKMLTO 

KKKPEEQHNKSVHVAGLSWVKPGSVOPFSKEEKTVAT 


6285 


1 


2879 


RSVIPGariSSRWPGLSRPRFMAAHEWDWFQRKKI^IGQISDIRV 
QNLQVERENVQKRTFTRWINLHIiEKCNPPLEVKDLFVDIQDGKI 
LMALLEVLSGRNLLHEYKSSSHRIFRIiNNIAKALKFLEDSNVKL 
VSIDAAEIADGNPSLVLGLIWNIILFFQIKELTGNLSHNSPSSS 
LAPGSGGTDSDSSFPPTPTAERSVAISVKDQRKAIKftLLAWVQR 
KTRKYGVAVQDFAGSWRSGLAFLAVIKAIDPSLVDMKQALEWST 
RENLEKAPSIAQDALHIPRLLEPEDIMVDTPDEQSIMTYVAQFL 
ERFPELEAEDIFDSDKEVPIESTFVRIKBTPSEQESKVFVLTEN 
GERTYTVNHETSHPPPSKVFVCDKPESMKEPRLDGVSSHALSDS 
^ * V JJU^uirufi. i oU J. SBPSPESS ILSSRKENGRSNS 
LPIKKTVHFEA0TYKDPFCSKNLSLCFEGSPRVAKESLRQDGHV 
LAVEVAEEKEQKQESSKIPESSSDKVAGDIFLVEGTNNNSQSSS 
CIIGALESTARHDEESHSLSPPGBNTVMADSFQIKVNIWV 
EGDYFEAXPLKASKFNSDLIDPASTSQAPNKVPSPHETKPDEDA 
EAFENHAEKLGKRSrKSAHKKKDSPEPQVKMDKHEPHQDSGEEA 
BGCPSAPEETPVDKlCPEVHEKAKRKSTRPHirSEEGEDDDLQGVG 
BELSSSPPSSCVSLETLGSHSEEGLDPKPqpPT.QTr^r<^\7TDtTT^TT3 

YPPHYEVPLAAVLEAYVEDPEDLKNEEMDLEBPEGYMPDLDSREI 
EBADGSQSSSSSSVPGESLPSASDQVLYLSRGGVGTTPASEPAP 
LAPHEDHQQRETKENDPMDSHQSOESPNLENIANPLEENVTKES 
ISSKKKEKRKHVDHVESSLFVAPGSVQSSDDLEEDSSDYSIPSR 
TSHSDSSIYLRRHTHRSSESDHFSLCSVEERSRSG 


€286 


2157 


1331 


SCKTENLLBMwWFgQGLSFLPSALVIWTSAAFIFSYirAVTLHlI " 
IDPALPYI SDTGTVAPEKCLFGA^5LNa:AAVLCIATI YVRYKQVH 
ALSPEENVI IKLNKAGLVLGILSCLGLS I VANFQKTTLFAAHVS 
GAVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCG 
VSALSMLTCSSVLHSGNFGTDLEQKLHWNPEDKGYVLHMTTAA 
aniDi joior f ^ri? iji I iRUFQKISLRVEANLHGLTLYDTAPCPTM 
NERTRLLSRDI 




1619 


276 

( 

1 
\ 
I 

i; 


KAI:»AaCCG5AMPYVSVGKSCVLLAMAQLQTRFYTDMKKYAVDDV"~ 
PFSIPAASEIADLSNIINKLLKDKNEFHKHVEFDFLIKGQFLRM 
PLDKHMEMENISSEBWEIEYVEKYTAPQPEQCMFHDDWISSIK 
SAESWILTGSYDKTSRIWSLEGKSIMTIVGHTDWKDVAWVKKD 

slsclllsasmdqtillwewnvernkvkaluccrghagsvdsia 
/dgsgtkfcsgswdkmlkiwstvptdeedemesstnrprkkqkt 

SQLGLTRTPIVTLSGHMEAVSSVLWSDABBlCSASWDHTrRVWD 
rSSGSLKSTLTGNKVFWCISYSPLCKRLASGSTDRHlRLWDPRT 
CDGSLVSLSLTSHTGWVTSVKWSPTHEQQLISGSLDNlVKIiWDT 
ISCKAPLYDLAAHBDKYLSVDWTDTGLLLSGGADNKLYSYRYSP 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=.Alanine, C=.Cysteine, D«Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Ha.stidine, l=Isoleucine, K=i>ysine, 
tisLeucine, M=Methionine, N=Agparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
Se=Serine, T=Threonine, V= Valine, 
W=:Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
X^possible nucleotide insertion) 
TTSHVGA ' 


6287 
6288 


278 

^ 


1482 


MQFFFNFQlGbRSTSGKEKXSGDAGFLGDALQLPLQCiALDEDF 

APAKLQVQKILCDLLLPENLKEGLKESSWSSLPCTKNfRPPDFHS 

VMEESQSLNEPSPKQSEEIPEVTSEPVKGSI.NRAQSAQSINSTE 

MPAREDCLKRVSSEPVLSVQBKGVLLKRKLSLLEQDVIVKEDGR 

NKLKKQGETPNEVCMFSLAYGDIPEEIilDVSDFECSLCMRLFFE 

pvrrPCGHSFCKNCLERCLDHAPYCPLCKESLKEYIiADRRYCVT 

QLLEELIVKyLPDELSERKKIYDEETAELSHLTKNVPlPVCTMA 

yPTVPCPLHVFEPRYRLMlRRSIQTGTKQPGMCVSDTQNSFADY 

GCMLQIRNVHPLPDGRSWDTVGGKRFRVLKRGMKDGYCTADIE 
YLEDV 


6289 


1 


743 


V i f iJKUbVGNLLLGASGMASGCKlGPSIUISDLANLGAECLR 
MLDSQADYLHIjDVMDGHFVPNITFGHPWESLRKQLGQDPFFDM 
HMMVSKPEQWVKPMAVAGANQYTFHLEATENPGALI KDIRENGM • 

kvglaikpgtsveylapwanqidmalvmtvepgfggqkfmedmm 

PK^/HWI^TQFPSLDIBVDGGVGPDTVHKCAEAGANMIVSGSAIM 
RS EDPRSVINLLEJNVCSEAAQKRSLDR 


6290 


1 


743 


VTLYPCRGljVGNIiLGASGMASGCKIGPSILNSDIiANLGAECLR 
MLDSGADYLHLDVMDGHFVPNITPGHPWESLRKOLGOOPFFDM 

hmmvskpeqwvkpmavaganqytfhleatenpoalikdirrngm 

ECVGIAIKPGTSVEYIAPWANQIDPIALVMTVEPGFGGQKPMEDMM 

pkvhwlrtqfpsldibvdggvgpdtvhkcaeaganmivsgsaim 

RSEDPRSVINriljRNVCSEAAQKRSLDR 


6291 


3 


1856 


Tl^RWLUSVYETVAPTLACLPRPRLRRRRRRRRRRMISRYTRKA 
VPQSLELKGITiCHALNHHPPPEKLEEtSPTSDSHEKDTSSQSKS 
DITRESSFTSADrGNSLSAPPSYlUAGISTEGSSDFSWGYGELD 
QNATBKVQTMFTAIDEIiLYEQKLSVHTKSLQBECQQWTASFPHL 
RILGRQIITPSEGYRLYPRSPSAVSASYETTIiSQERDSTIFGIR 
GKKLHFSSS YAHKASS IAKSSSFCSMERDEEDS I IVSEGI lEEY 
LAPDHIDIEEGFHGKKSBAATEKQKLGyppIAPFYCMKEDVLAY 

vfdsvwckwscmeqltrshwegfasddesnvavtrpdsesscv 

LSELHPLVLPRVPQSKVLYITSNPMSI,CQASRHQPNVNDLLVHG 

mplqprnlslmdklldlddkllmrpgsstilstrnwpnravefs 

TSSLSYTVQSTRRRNPPPRTLHPISTSHSCAETPRSVEEILRGA 

Rvpvapdslsspsptplsrnnllppigtaevehvstvgpqrqmk 

PHGDSSRAQSAWDEPNYQQPQERLLIiPDFFPRPWTTQSFLLDT 

qyrrscaveyphqarpgrgsagpqlhgstksqsggrpvsrtrqg 


6292 


1732 


602 


LVAKMASSASARTPAGKRVINQEELRRLMKEKQRLSrSRKRIES 

ppakynrlgqlscalcntpvksellwqthvlgkqhrekvaelkg 

AKEASQGSSASSAPQSVKRKAPDADDQDVKRAKATLVPQVQPST 

sawttnfdkigkefiratpskpsglsllpdyedeeeeeeeeegd 
gerkrcdaskplsdaqgkehsvsssrevtssvripndffstnppk 
apiiphsgsiekaeihekwerrentaealpegffddpbvdarv 

»*-v V n^^^i jwjviiwijj-,r ujuiWKyvjN i. iSEAXVAEEDEEGRIiDRO 

IGEIDEQIBCYRRVEKLRNRQDEIKNiCLKEILTIKELQKKEEBN 
ADSDDBGELQDLLSQDWRVKGALL 


6293 


1835 


1142 


rcPGAMKMVAPWTRPYSNSCCLCCHVRTGTILLGVWYLlINAW 

LLILLSALADPDQYNFSSSELGGDFEFMDDANMCIAIAISLLMI 

IiICAMATYGAYKQRAAWI IPFFCYQIFDFALNMLVAITVLI YPN 

SIQEYIRQLPPNFPYRDDVMSVNPTCLVLIILLFISIILTFKGY 

LISCVMNCYRYINGRMSSDVLVYVTSNDTTVLLPPYDDATVNGA 
flKEPPPPYVSA 




2382 


1035 i 

y 
I 


FWCTLGTVDVHPIGWCAINSKILVPPRTIHAKFTDWKGYLMKRI* 
/GSRTLPVDFHIKMVESMKYPFRQGMRLEWDKSQVSRTRMAW 
JTVIGGRLRLLYEDGDSDDDFWCHMWSPLIHPVGWSRRVGHGIK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acxd segment containing signal peptide 
CA=Alanine, C«Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G:=Glycine, 
H=:Histidine, I=Isoleucine, K=I*ysine, 
L=Leucine, M-Methionine, N^Asparagine , 
P-Proline, Q=Glutamine, R^Arcrinine 
S=serine, T-Threonine, V=Valine, 
W=Tryptophan, Y=.Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotide insertion) 








MSERRSDMAHHPTFRKIYCDAVPYLFKKVRAVYTEGGWFKEGMK 
LEAIDPI^LGNICVATVCKVIiLDGYLM I CVDGGP S TDGLDWPC Y 

HASSHAIFPATFCQKNDIBLTPPKGYEAQTPNWENYLEKTKSKA 

APSRLFNMDCPNHGFKVGMIOiEAVDLMEPRLICVATVKRVVHRl, 

LStHFDGWDSEYDQWVDCESPDIYPVGWCELTGYQLQPPVAAEP 

ATPLKAKEATKKKKKQFGKKRKRIPPTKTRPLRCX3SKKPLLEDD 

PQGARKISSEPVPGEIIAVRVKEEHLDVASPDKASSPELPVSVE 
NIKQETDD 


6394 


354 


1814 


AQXiTTRCRTVAGGVRWIPSPPPDLELYSCCLGTDkGFPELSHHC" 

KNVIATASDYDMAEITNIRPSFDVSPWAGLIGASVLWCVSVT 

VFVWSCCHQQABKKHKNPPYKFIHMLKGISIYPETLSNKKKIIK 

VRRDKDGPGREGGRRNLLVDAAEAGLLSRDKDPRGPSSGSCIDQ 

LPIKMDYGEELRSPITSLTPGESKTTSPSSPSEDVMLGSLTFSV 

DYNFPKKALWTIQEAHGLPVMDDC3TCK3SDPYIKMTILPDKRHR 

VKTRVLRKTLDPVFDETPTFYGIPYSQLQDIiVLHPLVLSFDRFS 

RDDVIGE VMVPIiAGVDPSTGKVQLTRDI I KRNIQKCISRGELQV 

SLSYQPVAQRMTW/LKARHLQKMDXAGLSGNPYVKVNVYYGRK 

RIAKKKTHVKKCTLNPIFNESFIYDIPTJDLLPDISIEFLVIDFD 

RTTKNEWGRLILGAHSVTASGAEHWREVCESPRKPVAKWHSLS 
EY 


6295 


2795 


S17 


VSSALLTGATSGSDAAKSEGASASPLSCTNAVAMDRPDEGIJPAK 

TRRLSSSESPQRDPPPPPppppLLRLPLPPPQQRPRLQEETEAA 

QVLADMRGVGLGPALPPPPPYVILEEGGIRAYFrLGAECPGWDS 

TIESGYGEAPPPTESLEALPTPEASGGSLEIDPQWQSSSPGGE 

u/ujri I (-s/vvtjWAfyKXjVDPKSKBEAI XIVBDBDEDERESMRSSR 

RRRRRRRRKQRKVKRESRERNAERMESILQALEDIQLDLEAVNI 

KAGKAFLRLKRKFIQMRRPFIiERRDLIIQHIPGFWVKAFLNHPR 

ISILINRRDEDIFRYLTNliQVQDLRHISMGYKMKLYFQTNPYFT 
NMVIVKEF0RtIRSGRIiVSHQTPTT?MWT5r:nr'Tm»nTiiT/->«M/-vi-in ^.tt^ 

FFSWFSNHSLPEADRIAEIIKN0LWVNPLRYYLRERGSRIKRKK 
QBMKKRKTRGRCEWIMEDAPDYYAVEDIFSBISDIDETIHDIK 
ISDFMETTDYFETTDNEITDINENICDSENPDHMEVPNNETTDN 
NESADDHETTDNNESADDNNENPEDNWKimJDNEENPNNNENrY 
GNNFFKGGFWGSHGNNQDSSDSDNEADEASDDEDNDGNEGDNEG 
SDDDGNEGDNEGSDDDDRDIE YYEKVI EDPDKDOADYEDVl E 1 1 
SDESVBEEGIEEGIQQDEDIYEEGNYEEEGSEDVWEBGEDSDDS 
DLEDVLQVPNGWANPGKRGKTG 


6296 
6297 


727 


1199 


RHCGCmQGACDSLPPTCTSSPVTARNAiPEARCCVWLLDG'TTV 
EAVRPARERIoARKEIiRQKRMQQFSRDSAYSSNKDSTCLLTERDT 
LGTSLQFPSPPSGTISFGSPSDSiSIFPUSSQCCLGPQQPSlSGK 
KWALIHKRVRLSVFGARWGRIYFGK 


6298 


1 


922 


QRAAAASPSSCGPRGAEYGALMAMEGYWRPLALLGSAIiLVGFLS 
VI FALVWVliHYREGLGWDGSALEFNWHPVLMVTGFVFI QGIAI I 
VYRLPWTWKCSKLLMKSIHAGLNAVAAILAIISWAVFENHNVK 
NIANMYSLHSWVGLIAVICYLLQLLSGPSVFLXiPWAPLSLRAFL 
MPIHVYSGIVIFGTVIATALMGLTEKLIFSLRDPAYSTPPPEGV 
FVNTLGLLILVFGALIFWIVTRPQWKRPKEPNSTILHPNGGTEQ 
GARGSMPAYSGNNMDKSDSSLNNEVAARKRNLALDEAGQRSTM 




3 


985 

1 

] 
] 


SVPI^RRLSLSGTLQGAGTTTKMAVARraAAVAAWVPCRSWGWAflV 
PFGPHRGLSVLIARIPQRAPRWLPACRQKTSLSFLNJ?PDLPNLA 
yiCKLKGKSPGIIFIPGYLSYMNGTKAriAIEEFCKSLGHACIRFD 
IfSGVGSSDGNSEESTLGKWRKDVLSIIDDLADGPQILVGSSLGG 
WLMLHAAIARPEKWALIGVATAADTLVTKFNQLPVELKKEVEM 
KGVWSMPSKYSEEGVYNVQYSFIKEAEHHCHiHSPIPVNCPIRL 
t^HGMKDDIVPWHTSMQVADRVLSTDVDVILRKHSDHRMREKADI 
JLLVYTIDDLIDKLSTIVN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acxd segment containing signal peptide 
(A»Alanine, C-Cysteine, D-Aspartic Acid, 
Glutamic Acid, P^Phenylalanine, G=tGlycine, 
H=Histidine, I=lsoleucine, KsLysine, 
L«Leucine, M=Methionine, N=Asparagine , 
ir— rioxmc/ w=^iucamine, R=Arginine, 
S=:Serine» T^Threonine, V^valine, 
W==Tryptophan, Y^Tyrosihe, X=Unknown, *-Stop 
v.(juu*i, / =possa.i>j.e nucieotiae aeletion, 
\=possible nucleotide insertion) 


6299 


512 


814 


ia\-ujji!,u±i»'ii'iy V i IjtJjijbi' iWta6i'LiOUlLViili*CVTSLDSAILTSS" " 

SIDAMDDSAFSGPYKFPFTPPLESPNLCFYTSQVPVPPILGFYQ 

MKEEEVQLRKNH 


6300 


121 


692 


AAPSCWSQKGVPAAGTPSSPRLLVl^RAAAPSAGPWGAWRQGARA 
AOSPPSIPNSSSVPYGSQDSVHSSPSDGISGGRDRPVGGSPGGPR 
LVIGSLPAHLSPHMFGGFKCPVCSKFVSSDEMDLHLVMCLTKPR 

ITYNEDVJl,SKDAGECAICLSEIiQQGDTIARLPCX.CiyHKGCIDE 
WFEVNRSCPEHPSD 


6301 


616 


284 


GKFVPVNWEPPQPbPFPKYIiRCYRCLLETKBLGGLLGSIi'i'eLTp 
RGSSCI TLHKKKSSGSDVMVSDCRSKEQMSDCSNTRTS PVSGPW 
IFSQYCPLD FCNDPQNRGLYTP 


6302 
■ 63 03 


490 


14S 


I FGFLHLPHMEHSPLLVCALPAHVFFS SSCGSS VALHSDPCLLS 
PVLLNCrjPGDLRPLDBLYAQKLECYKAISEELDHALNDMTSL 




2 


1951 


YWNEYGGGLI.WQSWQEKHPGQAI»SSBPWNFPDTKEEWEQHYSQL 
YWYYLEQFQYWEAQGMTFDASQSCDTDTYTSKTEADDKNDEKCM 
KVDLVS PLSS P IMGDNDSSGTSDKDHSEILDGI SNI KLNSEEVr 
QSQLDSCTSHDGHQQLSEVSSKRECPASGQSEPRNGGTNEESNS 
SGNTWTDPPAEDSQKSSGANTSKDRPHASGTDGDESEEDPPEHK 
PSKLKRSHELDIDENPASDPDDSGSLLGPKYGSGQKYGGIPNFS 
HRQVRYliEKNVKLKSKYLDMRRQIKMKNKHIFFTKESEKPFFKK 
SKILSKVEKFLTWVNKPMDEEASQESSSHDNGHDASTSCDSEEQ 
DMSVKKGDDLLETNNPEPEKCQSVSSAGELETEKYERDSLIiATV 
PDEODCVTQEVPDSRQAETEAEVKKKKNKKKNKKVNGLP PE I AA 
VPELAKYWAQRYRLFSRFDDGIKLDREGWFSVTPEKIAEHIAGH 
VSQSPKCDWVDAFCGVGGNTIQFALTGMR VI AIDI DPVK lAIiA 
RNNAEVYGIADKIEPICGDPLLLASFLKADWPLSPPW3GPDYA 
TAETFDIRTMMS PDGPEI FRLSKKITNNI VYFLPRNADIDQVAS 
LAGPGGQVB lEQNFIiNNKLKTI TAYFGDLIRRPASET 


6304 


1 


1438 


HRARVDRSRES PGGDLRHPGRVRRD I TLSGHPRLSrQH VVLLRE 
DEVGDPGTKDIiGHPQHGSPIQETQSEVVTLVSPLPGSDMAALPA 
WRATSGLTLWPHTAEGRDLLGAENRALTGGQQAEDPTLASGAYQ 
WPSSVEKLQGSWJCDAETLLSSSRTGGQAPPMLTDHDVQMIiRLL 
AQGEWDKARVPAHGQVLQVGPSTEAALQDLSSPRLSQLCSQGL 
CGLIKRPGDLPEVLSPHVDRVLGLRRSLPAVARRFHSPLLPYRY 
TDGGARPVIWWAPDVQHLSDPDSDQNSLALGWLQYOAIjLAHSCN 
WPGQAPCPGIHHTEWARLALFDPLLQVHDRIiDRYcCGFBPEPSD 
PCVEERUREKCRNPAELRLVHILVRSSDPSHLVYIDNAGEn.QHP 

edklnprllegidgf?esavicviasgci:*qnmj:.lkslqmdpvpwe 
sqggaqglkqvlqtleqrgqvllghlqkhniitlfrdedp 


6305 


93 


420 


imi WRGRS T YRPRPRRS VPPPBLIGPMLEPGDEEPQQEEPPTES 
RDPAPGQEREEDQGAAETQVPDLEADLQELSQSKTGDECGDGPD 
VQGKILTKSEQPKMPEGR 


6306 


1 


1874 


J?TRPSKVKV?HTFLIHSYTRPTVCQACKKLLKGLFRQGLQCKDC 
KFNCHKRC ATR VPNDCLGEAli 1 NGDVPMir pa "mPCR a n v c a t lur^ 

ESSDSGVI PSSHSENALHASEEEEGEGGKAQS SLGYI PLMRWQ 
SVRHTTRKSSTTLREGWWHYSNKDTLRKRHYWRLDCKCITLFQ 
NNTTNRVYKEIPLSEILTVESAQWFSLVPPGTNPHCFEIVTAKA 
TYFVGEMPGGTPGGPSGQGAEAARGWETAIRQALMPVILQDAPS 
APOHAPHRQASLSISVSNSQIQEMVDIATVYQIFPDBVLGSGQP 
GWYGGKHRKTGRDVAVKVIDKLRPPTKQESQLRNEVAIIiQSLR 
HPGIVNLECMFETPEKVFWMEKLHGDMLEMILSSEKGRLPERL 
TKFLrTQILVALRHLHFKNIVHCDLKPENVI.tASADPPPQVKLC 
DFGFARlIGEKSFRRSWGTPAYLAPEVIiLNQGYNRSLDMWSVG 
VIMYVSLSGTPPFNEDEDXNDQIQNAAFMYPASPWSHISAGAID 
LINNLLQVKMRKRYSVDKSLSHPWLQBYQTVOjDLRELEGKMGER 
YITHESDDARWEQFAAEHPLPGSGLPTDRDTbGGACPPQDHDMQG 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


; Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing cignal peptide 
(A=Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P= Phenyl alanine, G=^Glycine, 
H=Histidine, I-Isoleucine, K-Lysine, 
L=Leucine, M^Methionine, N=Asparagine , 
P^Proline, Q=Gluceimine, R=;Arginine, 
S-Serine, T-Threonine, V= Valine, 
W=rryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
IiAERISVL " _ . 


6307 


2136 


589 


CFLLPRGRDPEPPEAGAAAPCAPGAPDMSFRKVVRQSKFRHVFG 
QPVKNE>QCyEDIRVSRVTWDSTFCAVNPKFLAVIVEASGGGAFL 
VLPliSKTGRIDKAyPTVCGHTGPVLDIDWCPHNDEVIASGSEDC 
TVMVWQIPENGLTSPLTEPWVLEGHTKRVGIIAWHPTARNVLL 
SAGCDNWriZWNVGTABEriYRLDSLHPDLIYNVSWNHNGSLFCS 
ACKDKS VR I IDPRRGTLVAEREKAHEGARPMRAX FLADGKVFTT 
GFSRMSERQLALWDPENLEEPMALQELDSSNGALLPFYDPDTSV 
VYVCGKGDSSIRYFEITEEPPYIHFLNTFTSKEPQRGMGSMPKR 
GLEVSKCEIARFYKLHERKCEPIVMTVPRKSDLFQDDLYPDTAG 
PEAALEAEEWVSGRnADPlLISLREAYVPSKQRDliKISRRNVIiS 
DSRPAMAPGSSHLGAPASTTTAADATPSGSLARAGEAGKtiEEVM 
QELRALRALVKEQGDRICRLEEQLGRMENGDA 


6308 


2 


1118 


GRPTRPBKMLLSLVLHTYSMRYXjLPSWLLGTAPTYVLAWGVSffR 
LLSAFLPARFYQALDDRLYCVYQSMVLFFFENYTGVQILLYGDL 
PKNKENI lYLAWHQSTVDWI VADILAIRQNALGHVRYVLKEGUC 
WLPLYGWYFAQHGGIYVKRSAKFNEKEMRNKLQSYVDAGTPMYL 
VIFPEGTRYNPEOTKVIiS ASQAFAAQRGLAVLiCHVLTPRI KATH 
VAFDCMKNYLDAIYDVTWYEGKDDGGQRRESPTMTEFIiCKECP 
KIHIHIDRIDKKDVPEEQEHMRRWLHERPEIKDKMLIEFYESPD 
PERRKRFPGKS VWSKLS I KKTLPSML ILSGLTAGM jMTDAGRKL 
YVNTW I YGTLLGCLWVTI KA 


6309 


220 


563 


l.vabvkbpcsiapmlsvdmenkengsvgvkl^s^aengrppdpadwa ' 
vmdwnyfrtvgfeeqasafqeqeidgkslllmtrndvltglql 

KLGPALKtYEYHVKPU3TKHIiKNNSS 


6310 


36 


979 


GPRCWKPLiILSSVNCETLRIGKAWPQ^SeSQBRYWTPRTHSSASS 

AQRGSLA3LNVAAAGLWADCDQPLYDCPMCGLICT1TYHILQEHV 

DLHLEENSFQQGMDRVQCSGDLQIxAIIQLQQBEDRKRRSEESRQE 

1EEFQKLQRQYGLDN3GGYKQQQLRNMEIEVNRGRMPPSEFHRR 

KADMMESLALOPDDGKTKTSGI r EALEIRYYQNAATDVRRVWLSS 

WDHFHSSLGDKGWGCGYRNFQMriLSSLLQNDAYNDCLKGMLIP 

CIPKIQSMIEDAWKEGFDPQGASQLIIRLQGTKAWIGACEVYIL 
LTSLRV 


6311 


1 


675 


PVWWNSCEGPKLAAAARTGHGVGRRARLACUGEPRVKAAVMLTL 
ASKLKRDDGDKGSRTAATASDSTRRVSVRDKLLVKEVAELEANL 
PCTCKVHPPDPNKLHCFQLTVTPDEGYYQGGKFQFETEVPDAYW 
MVPPKVKCLTKIWHPNITETGEICLSLLREHSIDGTGWAPTRTL 
KDWWGIiNSLFTDLUTFDDPIiNIEAABHHLRDKEDFRNXVDDyi 
KRYAR 


6312 


213 


1400 


GDELVKREAGMKMLPGVGVPGTGSSARVLVPLLRABGFTVEALW 
GKTEEEAKQLAEEMNIAFYTSRTDDILLHQDVDLVCISIPPPLT 
RQISVKALGIGKNWCEKAATSTOAFRMVTASRYYPQLMSLVGN 
VLRFLPAFVRMKQLI SEHYVGAVMI CDAR2 YSGSLLS PS YGWI C 
DEIjMGGGGLHTMGTYIVDLLTHLTGRRAEKVHGLLKTFVRQNAA 

irgirhvtsddfcffqmlmgggvcstvtiinfnmpgapvhevmvv 
gsagrlvargadlygqknsatqeelllrdslavgaglpeqgpqd 
vpllylkgmvymvqalrqspqgqgdrrtwdrtpvsmaasfedgl 
vmqswdaikrssrsgeweavevlteepdtnqnlcealqrnnl 


6313 


2 


2071 


QRSGAARLAFLPS PFS PACVHRSPLSFHGCWF YFWVFMPLGVL" " " 

fhrrrahgctlscssfveqptameaeetmeclqbfpehhkmild 
rlneqreqdrftditlivdghhfkahkavlaacskffykffqef 
tqeplveiegvsk^1afrhlieftytaklmiqgeeeandvwkaae 

FLQMLEAIKALEVRNKENSAPLEENTTGKNEAKKRKIAETSNVI 
TESLPSAESEPVEIEVEIAEGTIEVEDEGIETLEEVASAKQSVK 
YIQSTGSSDDSALALLADITSKYRQGDRKGQIKEDGCPSDPTSK 
□VEGIEIVELQLSHVKDLFHCBKCNRSFKLPYHFKEHMKSHSTE 
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SEQ 
ID 
NO: 


Predictod 

beginning 

nucleotide 

location 

c or re sponding 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 

nucleotide 

location 

c orr e s ponding 

to first 

amino acid 

residue of 

amino acid . 

sequence 


Ammo acid segment containing signal peptide 
(A^Alanine, C=Cyateine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=^Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L^Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
SsSerine» TsThreonine, V=sValine, 
W^Tryptophan, Y«Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








SFKCEICNKRYLRESAWKQHLNCYHLEEGGVSKKQRTGKKIHVC ■ 
QYCEKQFDHFGHFKEHLRKHTGEKPFSCPNCHERFARWSTLKCH 
LTACQTGVGAKKGRKKLYECQVCNSVFNSWDQPKDHLVIHTGDK 
PNHCTLCDLW FMQGNELRRHLSDAHNI SERLVTEEVLS VETRVQ 
TEPVTSMTI I EQVGKVHVLPLLQVQVDSAQVTVEQVHPDLLQDS 
QVHDSHMSELPEQVQVSYLEVGRIQTEEGTEVHVEELHVERVNQ 
MPVEVQTELLEAPLDHVTPEIMNQEERESSQADAAEAAREDPIED 
AEDLETKPTVDSEAEKAENEDRTALPVLE 


6314 


2 


2071 


QRSGAARLAFLPSPFSPACVHRSPLSFHGCWFYFWVFMPIiGVL 
FHRRRAHGCTLSCSSFVEQPTAMEAEETMECLQEPPEHHKMILD 
RLNEQREQDRFTDITLlVDGHHFKAHKAVliAACSKPPYKFFQEF 

TOEPliVETPdVCTTMSPUXTT TtrTTTVPRirT iuTYr(r»T?'i:iT?7vvit^»rf»Tj>ii « i-i 
*=>j.*iV3 voru"jrtj:«rnjj.ar J. x l/U^ijMiyCjhJiEANDVWKAAB 

PLQMLEAIKALEVRNKENSAPIiEENTTGKNEAKKRKIAETSNVI 

TESLPSAESEPVEIEVEIAEGTIEVEDEGIETLEEVASAKQSVK 

YIQSTGSSDDSALALLADITSKYRQGDRKGQIKEDGCPSDPTSK 

QVEGIEIVELQLSHVKDLFHCEKCNRSFKLFYHFKEHMKSHSTE 

SFKCEICNKRYLRESAWKQHLNCYHLEEGGVSKKQRTGKKIHVC 

QYCEKQFDHFGHFKEHLRKHTGEKPFECPNCHERFARNSTLKCH 

LTACQTGVGAKKGRKKLYECQVCNSVFNSWDQPKDHLVIHTGDK 

PNHCTLCDLWFMQGNELRRHLSDAHKf I S ERTjVTFP VT , d^TOTTJ^jn 

TEPVTSMTIIEQVGKVHVLPLIjQVQVDSAQVTVBQVHPDLIiQDS 
QVHDSHMSELPEQVQVSYLEVGRIQTEEGTBVHVEELHVERVNQ 
MPVEVQTELI^ ADLDHVTPE IMNQEERESSQADAAEAAREDHED 
AEDLETKPTVDSEAEKAENEDRTAIiPVIiS 


6315 


1 


1015 


LGLAVNWTTLVLI S YCPTATEEAPY WTYLLCALGLFI YQSLDA 
IDGKOARRTNSCSPLGELFDHGCDSIjSTVFMAVGASIAARLGTY 
PDWFFSCSFIGMFVFYCAHWQTYVSGMLRPGKVDVTBIQIALVI 
VFVI^APGQATMWDYTIPILEIKLKILPVLGFLGQVIFSCSNYF 
HVIIiHGGVQKNGSTIAGTSVLSPGLHIGLXIILAIMIYKKSATD 
VPEKHPCLYILMFGCVFAKVSQKLWAHMTKSELYLQDTVFLGP 
GLLFLDQYPNNFIDEYWLMMAMVISSFDMVIYPSAIjCLQISRH 
LHLNI FKTACHQAPEQVQVLSSKSHQNWMD 


€316 


1SG3 


792 


VSAGAGTGlMGGTTSTRRVrFEADENENITWI^6±RLSENVIDR 

mkesspsgsksqrysgaygasvsdeelkrrvaeelai^bqakkes 

EDQKRX^KOAICELDRERAAANEQLTRAILRERICSEEERAKAKHL 

arqleekdrvlkkqdafykeqiarleerssefyrvtteqyqkaa 
eeveakfkryeshpvcadlqakilqcyrekthqtlkcsalatqy 
mhcvnhakqsmlekgg 


6317 


102 


839 


psaqtsavlarekghlpimrheapmqmasaqdarygqkdssdqu 

FDYMPKLLX IGNS S VG KTS PLFRYADDS PTSAPVST VGIDFKVK 

tvfknekripclqiwdtagqeryrtittayyrgamgfilmyditn 
EESFNAVQDWSTQ I KT yswdnaqvilvgnkcdmedervisterg 
QHIiGEQLGPEFFETSAKDtTINVKQTFERLVDIICDECMSESLETD 
PAI TAAKQNTRLKETPPPPQPNCAC 


6318 




733 


PWHPLRTIiPLHHPHPRPPRAEGREGADSMSHLPGLELRREAPPL 
LGPLLSPFPLPAGSt«IRQMLRSSLRFPITNSAGAPCKAAGHMNI 
LAPVRRDRVLAELPQCLRKEAALHGHKDPHPRVTCACQEHRTGT 
VGFKISKVIWGDLSVGKTCLINRFCKDTFDKNYKATIGVDPEM 
ERFEVLGI PFSLQLWDTAGQBRFKCIASTYYRGAQAI 1 1 VFNLN 
DVASLEHTKQWLADALKENDPSSVLLFLVGSKKDLSTPAQYALM 
EKDALQVAQEMKAEYWAVSSLTGENVREFFFRVAALTFEANVIA 
ELEKSGARRIGDWRINSDDSNLYLTASKKKPTCCP 


6315 


88 


717 


AATMRLNQNTLLLGKKWLVPYTSEHVPSRYttEWMKSEELQRLT " 
ASBPLTLEQEYAMQCSWQEDADKCTFIVLDAEKWQAQPGATEES 
CMVGDVNIiFLTDLEDLTLGEIEVMIAEPSCRGKGLGTEAVLAWL 
SYGVTTLGLTKFEAKIGQGNEPSIRMFQKLHPEQVATSSVFQEV 



485 



wo 01/53312 



PCT/USOO/34263 



SBQ 
ID 
NO: 


Preolicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=:Aspartic Acid, B= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
Hs>Histidine, I=rsoleucine, K=Lysine, 
L=sLeucine, M=Methionine, N=Asparagine, 
P^Proline, Q=Glutamine, R=Arginine, 
S=Serine, T-Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








TLRLTVSESEHQWLLEQTSHVEBKPYRDGSAEPC 


6320 


90 


1111 


RPRTGREKVAMAAVDSFYLLYREIARSCNCYMEALALVGAWYTA 
RKSITVICDFYSLIRLHFIPJRLGSRADLIKQYGRWAWSGATDG 
IGKAYAEEIiASRGLNI ILrSRNEEKLQWAKDIADTYKVETDII 
VADPSSGREIYLPIREALKDKDVGILVNNVGVFYPYPQYFTQLS 
EDKLWD I INVNIAAASLMVHWLPGMVERKKGAI VTI SSGSCCK 
PTPQLAAFSASKAYLDHFSRALQYEYASKGI EVQSIiIPF YVATS 
MTAPSNFLHRCSWLVPSPKVYAHHAVSTLGISKRTTGYWSHSIQ 
FLFAQYMPEWLWVWGAWILNRSLRKEALSCTA 


6321 


1418 


341 


HRKAALGALMAGRLLGKALAAVSLSLAIASVTIRSSRCRGIQAP 
RNS FSSS WFHLNTNVMSGSNGSKENSHNKARTS PYPGSKVERSQ 
VPNEKVGWLVEWQDYKPVEYTAVSVLAGPRWADPQISESNPSPK 
FNEKDGHVERKSKNGLYEIBNGRPRNPT^SRTGLVGRGIiLGRWGP 
NHAADPIITRWJCRDSSGWKIMHPVSGKHILQFVAIKRKDCGEWA 
I PGGMVDPGE KI SATLKREFGEEALNSIiOKTSAEKRE I EE KliHK 
LFS QDHLVI YKG YVDDPRNTDNAWMETEAVNYHDETGE IMDNLM 
liEAGDDAGKVKWVDINDKLKLYASHSQPlKLVAEKRDAHWSEDS 
EADCHAL 


6322 


2047 


1083 


NOE I liKNVES S RTVOPHPT iP T7T.T .<3 T .(^M Q vn VfiT? w OP w'rr'xri? oni a 
WSIWCCXIDGEGSQQEEVISSEXUGASIFNGQKICVLYyADAIiTBI 
AFWPSPVESLTDSLESNlSDQDSDSNMDXjMPGILKQPSliTLEL 
FPNHTDNIiNSSQRLS PSS RMRKLPQGRPVPPLGPBTRVSWW VE 
RYDDIENFPLSELMTEISTGVETTAHSSTSLRSTTLEKEVPVIF 
IHPLNTGLPRIKIQGATGKFNMVIPLVDGMIVSRRALGFLVRQT 
VIWICRRKRLESDSYSPPHVRRKQKITDIVWKYRNKQLBPEPYT 
SLFQEVGLKNCSS 


6323 


1 


656 


PASTTDQAQEARVPLDGAFWIPRPPAGSPKGCFACVSKPPALQA 
PAAPAPEPS AS PPMAPTLFPMESKSS KTDSVRAAGAPPACKHLiA 
EKKTMTNPTTVIEVYPDTTEVNDYYLWSIFNFVYIiNFCCIiGFIA 
LAYSIiKVRDKKLLNDIiNGAVEDAKTDRLINITRSGLAASCIMLW 
MAIiS VIATHRGLRSSAS ILVAEPHDWWrERPQVTFRERCPAL 


6324 


1 


20^1 


EGAGMRRCPCRGSLNEAEAGALPAAARMGLEAPRGGRRRQPGQQ 
RPGPGAGAPAGRPEGGGPWARTEGSSLHSEPERAGLGPAPGTES 
PQAEFWTDGQTEPAAAGLGVETERPKQKTEPDRSSLRTHDBWSW 
SEIiGTTCLWTETGTDGLWTDPHRSDIiQFQPBBASPWTQPGVHQP 
WTELETHGSQTQPERVKSWADNLWTHQNSSSLQTHPEQACPSKE 
PSADGSWKELYTDGSRTOQDIEGPWTEPYTDGSQKKQDTEAARK 
QPGTG6FQIQQDTDGSWTQPSTDGSQTAPGTDCLLGEPEDGPLE 
EPEPGELIiTHLYSHLKCSPLCPVPRLI I TPETPEPEAQPVGPPS 
RVEGGSGGPSSASSFDESEDDWAGGGGASDPEDRSGSKPWKKL 
KTVLKYSPFWSPRKHYPWQLSGHAGNFQAGBDGRILKRPCQC 
EQRSLEQLMKDPIiRPPVPAYYQMVLQDQQTFWQMEDLLADFEGP 
SIMDCKMGSRTYIiEEEIiVKARERPRPRKDMYEKMVAVDPGAPTP 
EEHAQGAVTKPRYMQWRETMSSTSTLGFRIEGIKKADGTCN-ITJF 
KKTQALEQVTKVIiEDFVCGDHVIIiQKYVACriEELREALEISPPF 
KTHBWGSSLLFVHDHTGIiAKVWMIDFGKTVALPDHQTLSHRLP 
WAEGNREDGYLWGLDNMICLLQGIAQS 


6325 


165 


944 


GLRDP FRRKRRLKPQVJKMSU YVKDM WPGSPQEKDS PS TSRSGGS 
SRLSSRSRSRSFSRSSRSHSRVSSRPSSRSRRSKSRSRSRRRHQ 
RKYRRYSRSYSRSRSRSRSRRYRERRYGFTRRYYRSPSRYRSRS 
RSRSRSRGRSYCGRAYAIARGQRYYGFGRTVYPEEHSRWRDRSR 
TRSRSRTPFRLSEKDRMELLEIAKTNAAKAIiGTTNIDLPASLRT 
VPSAKETSRGIGVSSKTGAKPEVSILGLSEQNFQKANCQI 


6326 


238 


680 


GEPSPATQQKPSATGAGVLHQHFSSGHIYVLMGLLPPPWTISFT 
VQTTLQPPGGLPAAPVSGRMAFEPVGRDLARRMVPRAGKRTQTL 
GARRVAAQGARPLPEDRRPKSGERLHVTVAPCWEFVLPSVSLTA 



4S6 



wo 01/53312 



PCTAJSOO/34263 



SKQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sec[uence 


Amino acid segment containing signal peptide 
{A«Alanine, C^Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=I*ysine, 
Ls= Leucine, M=Methionine, N^Asparagine, 
P^Proline, Q«Glut amine, R=Arginine, 
SssSerine, T=^Threonine , V=Valine, 
Wt=Tryptophan, Y=Tyrosine, X^Unknown, *sStop 
Codon, /^possible nucleotide deletion, 

QAWGGVGQEASSGVP " 


6327 


1 


1337 


SLARIiAPAGGSWMPTQQPAAPSTRAPKPSRSLSGSLCALFSDA 

DSGSGMKAELPPGPGAVGREMTKEEKLQLRKEKKQQKKKRKEEK 

GAEPETGSAVSAAQCQGPTRELPESGIQLGTPREKVPAGRSKAE 

LRAERRAKQEAERALKQARKGEQGGPPPKASPSTAGETPSGVKR 

LPEYPQVDDLLLRRLVKKPERQQVPTRKDYGSECVSLFSHLPQYS 

RQNSLTQFMSIPSSVIHPAMVRIiGLQYSQGLVRGSWARClALLR 

ALQQVIQDYTTPPNEELSRDLVNKLKPYMSFLTQCRPLSASMHN 

AIKFLNKEITSVGSSKREEEAKSELRAAIDRYVQEKIVLAAQAX 

SRPAYQKISNGDVILVYGCSSLVSRILQEAWTEGRRFRVWVDS 

RPWLEGRHTUiSLVHAGVPASYI^IPAASrVLPEVSTESKDSKV 
GGBKV 


6328 


1030 


276 


HASAEVTa-AAARGLGAMEBEMHTDAKIRAENGrGSSPRGPGCSL 
RHFACEQNLLSRPDGSASFLQGDTSVLAGVYGPAEVKVSKEIFJI 
KATLEVILRPKIGLPGVAEKSRERLIRNTCEAWLGTLHPRTSI 
TWU^WSDAGSLLACCIJNAACMALVDAGVPMRALFCGVACALD 
SDGTLVLDPTSKQEKEARAVLTFALDSVERKLLMSSTKGLYSDT 
ELQQCLAAAQAASQHVFRFYRESLQRRYSKS 


B223 


3 


2016 


SSEVAAGGGTRSAMAEQSGEWTVSATGAANGLNNGAGGTSATT 

SNPLSRKLHKILETRLDNDKEMLEALKALSTFFVENSLRTRRNL 

RGDIERKSIiAINEEFVSIFKEVKEELESISEDVX3AMSNCCX3DMT 

SRXiQAAKBQTQDLIVKTTKLQSESQKLEIRAQVADAFLSKFQLT 

SDEMSLLRGTREGPITEDFFKALGRVKQIHNDVKVLLRTOQQTA 

GLEIMEQMALLQETAYERLYRWAQSECRTLTQESCDVSPVLTQA 

MEALQDRPVLYKYTLDEFGTARRSTWRGFIDALTRGGPGGTPR 

PIEMHSHDPLRYVGDMLAWLHQATASEKEHLEALLKHVTTQGVE 

ENIQEWGHZTEQVCRPLKVRIBQVXVaEPGAVLLYKISNLLKP 

YHHTISGIVGNSATALLTTIEEMHIiI>SKKIFFNSt.SLHA5KLMD 

KVELPPPDLGPSSALNQTLMLLREVIiASHDSSWPLDARQADFV 

QVLSCVLDPLLQMCTVSASNLGTADMATFMVNSnYMMKTTLALF 

E PTDRRLEMLQFQIEAHLDTLINEQAS YVLTRVGLS Y I YWTVQQ 

HKPEQGSLANMPNLDSVTLKAAMVQFDRYLSAPDNLLIPQLNFL 

xjoAx vAiivi vj^yoiiSijvt-KAiUlivxAAVMWPINEYKDPBNIIjH 

SPQQVQTLLS 


6330 


llSl 


333 


FFYYTFYENKTFSRKMVAEKETLSIjNKCPDKMPKRTKLLAQQPI. 
PVHQPHSLVSEGFTVKAMMKNSWRGPPAAGAFKERPTKPTAFR 
KFYERGDFPIALEHDSKGNKIAWKVEIEKUDYHHYLPLFFDGLC 
EMTFPYEFFARQGIHDMLEHGGNKIDPVLPQLIIPIKNALWLRN 
RQVXCVTLKVLQHLWSAEMVGKALVPYYRQILPVLNIPKNMNV 
HiaijiJ\3XiJ I c^s^AKiMM xLrlJijJLUlL 1 Jj SAFER YGGENAFINIKYWP 
TYESCLLN 


6331 


3 


495 


QQGQRVRTRGRRACASATPXjBGCVDLSYPRTHAALLKVAQMVTL 
LIAFICVRSSLWTNYSAYSYFEWTICDLIMILAFYLVHLFRFY 
RVLTCISWPLSELLHYLIGTLLLLIASIVAASKSYNQSGLVAGA 
I FGFMATFLCMASIWLS YKISCVTQSTDTWVV 


6332 


1 


878 


.VTESNKFDLVSFIPLLRERIYSNNQYARQPIISWILVLfiSVPDI 

nlldylpeildglpqilgdngkeirkmcewlgeflkeikknps 
svkpaemanilvihcqttddliqltamcwmrefiqlagrvmlpy 

SSGILTAVLPCLAYDDRKKSIKEVANVCNQSLMKLVTPEDDELD 
ELRPGQRQAEPTPDDALPKQSGTASGEWTPSLHIiTSCRGPREPD 

vigvalgphlsnqdyfmyvthtivaatqrsgssgsppfcrqdtg 
klstmathsqlvktgtgleprqavss sh 


S333 


3 


1467 


trtpseaeaggespqscvsaahsdwtagkpvsllaplipprsag 
qpltfspsgrqplrsrilvgmcsgsgrrrsslsptmrpgtgaerg 
glmmghpgmhyapmgmhpmgqrakmppvphgmmpqmmppmggpp 
mgqmpgmmssvmpgkwmshmsqasmqpaiippgvwsmdvaagtas 



487 
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SEQ 
ID 
NO: 


Predicted 

beginaingr 

nucleotide 

location 

cor re spondi ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding- 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide" 
(A=Alanine, C=Cysteine, D^Aspartic Acid,' E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine , 
P=Prollne, Q^Glutamine, R=Arginine, 
S=Serine, TsThreonine , V=Valine, 
W«Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








GAKSMWTEHKSPDGRTyYYNTETKQSTWEKPDDIiKTPAEQLLSK 
CPWKSYKSDSGKPYYYWSQTKESRWAKPKELEDLEGYQNTIVAG 
SLITKSNIiHAMIKAEESSKQEECTTTSTAPVPTTEIPTTMSTMA 
i AAEAAAAWAAAAAAAAAAAAANANASTSASNTVSGTVPWPEP 
EVTS IVATWDNENTVTI STEEQAQLTSTPAIQDQSVEVSSNTG 
EETSKQETVADFTPKKEBEESQPAKKTYTWNTKEEAKQAFKELL 
KBKRVPSNASWEQAMKMI INDPRy$AIiAKLSEKKQAPNAYKVOT 
EKK 


6334 


- 17 


644 


GGNPSGRAAGFAAAAMPSSPLRVAWCSSNQWRSMEAHNILSKR " 

GFSVRSPGTGTHVFCLPGPAPDKPNVYDFKTTYDQMyNDLLRKDK 

ELYTQNGILHMIjDRNKRIKPRPBRFQNCKDLFDLILTCEERVYD 

QWEDLNSREQETCQPVHWNVDIQDNHEEATLGAFLICELCQC 

IQHTEDMENEIDELLQEFEEKSGRTFLHTVCFY 


6335 


82 


S29 


AARARPOVLCCRIjIliGAALGDQSRVEMSYIPGQPVTAWQRVEIH 
KLRQGBNLILGFSXGGGIDQDPSQUPFSEDKTDKGIYVTRVSEG 
GPAE lAGIiQIGDKIMQVWGWDMTMVTHDQARKRLTKRSSEWRL 
LVTRQSLQKAVQQSMLS 


6336 


1003 


438 


HEPASKGRAfiVGNMRLSVAAAISHGRVFRRMGUSPESRIHLLRN 
LLTGLVRHERIEAPWARVDEMRGYAEKLIDYGKLGDTKERAMRM 
ADPWLTEKDLIPKLPQVLAPRYKDQTGGYTRMLQIPNRSLDRAK 
MAVIEYKGNCLPPLPLPRRDSHIiTLIiNQIiLQGLRQDLRQSQET^ 
NHSSHTAQTPGI 


6337 


76 


524 


EGIQMLSVQPDTKPKGCAGCNRKIKDRYLLKAXJDKYWHEDCLICC"' 
ACCDCRLGEVGS TLYTKANL I LCRRDYLRLFGVTGNTCAACS KLI 
PAPEMVMRAKDNVYHLDCPACQLCnSTQRFCVGDKFFLKKNMILCQ 
TDYBEGLMKEGYAPQVR 


6338 


66 


1349 


APNSESGTQGPLPTPANLFWTRRANPDPTTSMSATDRMGPKAVP 
GIiRIALLriLLGLGTPKSGVQGQEGLDFPEYDGVDRVINVNAKNY 
KtWKKYEVIALLYHEPPEDDKASQRQFEMEELILEIiAAQVIiED 
KGVGPGLVDSEKDAAVAKKLGLTEVDSMYVFKGDEVIEYDGEFS 
ADTI VEFLLDVLEDPVELIEGERELQAFENI EDE I KLIGY PKS K 
DSEHYKAFEDAAEEFHPYI PFFATFDSKGAKKLTLKLNE I DFYE 
AFMEEP VTI PDKPNSEEE IVNFVEEHRRSTLRKLKPESMYErWE 
DDMDGIHI VAFAEEADPDGPEFLETLKAVAQDNTENPDLS I XWI 
DPDDFPI/LVPYWEKTFDIDLSAPQIGWNVTDADRnWMEMDDEE 
DLPSAEELEDWLEDVLEGE INTEDDDDDDDD 


6339 


246 


1813 


NRCDRGGGGQAERQAGQG CRTQGAGPGFGFGHS PFS QGAMKAFH 
TFCWXLVFGSVSEAKFDDFEDEEDIVEYUCNDFAEFEDVMEDS 
VTESPQRVriTEDDEDETTVELEGQDENQEGDFEnADTQEGDTE 
S EP YDDEEFEGYEDKPDTSB S3CNKDPITI VDVPAHLQNSWES YY 
LEILMVTGLLAY IMNYI IGKNKNSRLAQAWFNTHRELLBSNFTL 
VGDDGTMKEATSTGKLNQENEHIYNLWCSGRVCCEQMLIQLRFI. 
KRQDLLNVLARMMRPV3DQVQI KVTMNDEDMDTYVFAVGTRKAli 
VRLQKEMQDLSEFCSDKPKSGAKYGLPDSLAILSEMGEVTDGMI-I 
DTKMVHFLTHYADKIESVHFSDQFSGPKIMQEEGQPLKLPDTKR 
TLLLTFNVPGSGNTYPKDMEALLPLMNMVIYSIDKAKKP^LNRE 
GKQKADKWRARVEENPLKLTHVQRQEAAQSRREEKKRAEKBRIM 
KEEDPEKQRRLEEAALRREQKKLEKKQMKMKQIKVKAM 


^340 


2 


5B3 


EACAHTIiSCPAFARLGRARRRPWMSHRTSSTFRAERS FHSSSSS 
SSSSTSSSASRALPAQDPPMEKAIiSMFSDDFGSFMRPHSEPLAF 
PARPGGAGKIKTLGDAYEFAVD VRDFSPEDI I VTTSNNHIEVRA 
EKLAADGTVMNNFAHKCQLPEDVDPTSVTSALREDGSLTIRARR 
HPHTEHVQQTFRTEIKI 


6341 


2 


6:45 


KMAVLSAPGLRGFRILGLRSSVGPAVQARGVHQSVATDGPSSTQ 
PALPKARAVAPKPSSRGEYWAKLDDLVHWARRSSLWPMTFGIA 
CCAVEMMHMAAPRYDMDRFGWPRASPRQSDVMIVAGTBTNKMA 
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SEQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
cor re spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal peptldi 
lA-Alanine, C=Cysteine, D-Aepartic Acid, E= 
Glutamic Acid, Fo Phenyl alanine, G=Glycine 
H=Hietidine, I-Isoleucine, K^Lysine, 
L«Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S= Serine, T=:Threonine , VsValine, 
W^Trypcophan, Y«Tyrosine, X-CJnkAown, ♦-Stop 
^uuuLi, /=possmj.e nucleotide deletion, 
\wpossible nucleotide insertion) 




5342 






ir'Ai^i^v YDQMPSPRYWSMGSCANGGGli J^HYSYSWRGCDRIVP 
VDIYIPGCPPTAEALLyGILQLQRKIKRERRLOTWYPP 




6343 


2 


1191 


iJt^KVKflMlATlAROAALRKlVbFaURGGGRGLWTCRPQSDMNNI 
KPLEGVKILDLTRVLAGPFATMNLGDLGAEVIKVERPGAGDDTR 
TWGPPFVGTESTYYLSVNRNKKS lAVNIKDPKGVKI IKEIAAVC 

DVFVENYVPGKLSAMGLGYEDIDEIAPHIIYCSITGYGQTGPIS 
QRAGYHAVASAVSGLMHITGPEVACLSHIAANYLIGQKEAKRWG 
TAHGSIVPYQAFKTKDGYIWGAGNNQQPATVCKILDLPELIDN 
SKYKTNHIiRVHNRKELIKILSERFEEELTSKWLYLFEGSGVPYG 
PINNMKNVFAEPQVLHNGLVMEMEHPTVGKISVPGPAVRYSKFK 
^ MSEARPPPLLGQHTTHILKEVLRYDDRAIGELT.QflrzTnmnop^ 




6344 


2 


33 6 


GTAMVSDEuii;j.NU.VIWDANPIWWGKQALKESQFTLSKCIDAV 
MVLGNSHLFMNRSNKLAVIASHIQESRPLYPGKNGRLGDPPGDP 
GNPPEFNPSGSKDGKYELLTSANEVIVEEIKDLMTKSDIKGQHT 
ETLIAGSIAKALCYIHRMNKEVKDNQEMKSRILVIKAAEJDSALQ 
YMNFMNVIFAAQKQNILIDACVLDSDSGLLQQACDITGGIiYLKV 
PQMPSLLQYIJL.WVFLPDQDQRSQLILPPPVHVDYRAACFCHRNL 
IBIGYVCSVCLSIFCNPSPICTTCErAFKISLPPVLKAKKKKLK 




S345 


2508 


14 7 


•liylV^l■A•i^U3NIJtGYG^^SPGIJC^SLTPPQIATPNLC^^ 

QSLLGPPPVGVPIWPSQFNLSGRNPQKQARTSSSTTPNRKDSSS 

QTMPVBDKSDPPEGSEEAAEPRMDTPEDQDLPPCPEDIAKEKRT 

PAPEPEPCEASELPAKRLRSSEEPTEKBPPGQLQVKAQPOARWr 

VPKQTQTPDLLPEALEAQVLPRFQPRVLQVQAQVQSQTQPRIPS 

TDTQVQPKLQKQAQTQTSPEHLVLQQKQVQPQLQQ3AEPQKQVQ 

PQVQPQAHSQGPRQVQLQQEABPLKQVQPQVQPQAHSQPPRQVQ 

LOLQKQVQTQTYPQVHTQAQPSVQPQEHPPAQVSVQPPEQTHEQ 

PHTQPQVSLLAPEQTPVWHVCGLEMPPDAVEAGGGMEKTLPEP 

VGTQVSMEEIQNESACGLDVGECENRAREMPGVWGAGGSLKVTI 

LQSSDSRAFSTVPLTPVPRPSDSVSSTPAATSTPSKQALQFFCY 

iCKASCSSQQEPQDHMSEPQHQQRLGElQHMSQACLLSLIiPVPR 

DVLETEDEEPPPRRWCNTCQLYYMGDLIQHRRTQDHKIAKQSLR 

PFCTVCWRYFKTPRKFVEHVKSQGHKDKAKELKSJbEKEIAGQDE 

DHFITVDAVGCFEGDEEEEBDDEDEEEIEVEEELCKQVRSRDIS 

REEWKGSETYSPWTAYGVDFLVPVMGYICRICHKFYHSNSGAQL 

SHCKSLGHFENLQKYKAAKNPSPTTRPVSRRCAINARNALTALF 

TSSGRPPSQPNTQDKTPgKVTTVRPSQPPLPRRSTRLKT 








3483 

: 

r 

r 


ir'KVKTKLILLVJ^JDKKRYERVGGQPKRLQRDVKMEEMIEQIiQElar- 

HELEKQNDTUCNRLISAKQQLQTQGYRQTPYNNVQSRINTGRRK 

ANENAGLQECPRKGIKFQDADVAETPHPMFTKYGNSLLEEARGE 

IRNLEKTVIQSQRGQIEELEHIAEILKTQLRRKENEIELSLXKSliR 

SQQATDQRSNIRDKVEMIKLHKQLVEKSWALSAMEGKFIQLQEK 

QRTLKISHDALMANGDELNMQLKEQRLKCCSLEKQLHSMKFSER 

RIEELQDRINDLBKBREUCKENYDKLYDSAFSAAHEEQWKLKEQ 

QLKVQIAQLETALKSDLTDKTEILDRLKTERDQNEKLVQENREL 

QLQYLEQKQQUJELiCKRIKLyNQSNDINADBLSEAIiLLIKAQKE 

QKNGDLSPLVKVDSEINKDLERSMRELQATHAETVQELBKTRNM 

blMQHKIWKDYQMBVBAVTRKMENLQQDYELKVEQYVHLLDIRA 

^RIHKLEAQLKDIAYGTKQYKPKPEIMPDDSVDEFDETIHLHRG 

EHLPEIHINKVTFSSEVLOASGDKEPVTFCTYAFYDFELQTTPV 

/RGLHPEYNFTSQYLVHVNDLPLQYIQKNTITLEVHQAYSTEYE 

PIAAOQLKFHEILEKSGRIPCTASLIGTKGDIPNPGTVEYMFRL 

WPMDQAIRLYRERAKALGYITSNFKGPEHMQSLSQQAPKTAOL 

;STDSTDGNLNELHITIRCCaraLQSRASHU3PHPYWYKFFDPA 

)HDTAIIPSSKDPQFDDHMYFPVPMNMDLDRYLKSBSI,SFYVFD 

)SDrQENiyi6iCVNVPLISriAHDRCrSGIPELTmQKHPAGTIH 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A«Alanine, C«Cysteine, D-Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H-Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=spos3ible nucleotide insertion) 








VHjKWKFAYLPPSGSITTEDIiGKFIRSEEPEWQRLPPASSVST 
LVLAPRPKPRQRLTPVDKKVS FVD IMPHQSDVSQEGSVDEVKEN 
TEKMQQGKDDVSLr,SEGQLAEQSLASSEDETEITEDLEPEVEED 
MSASDSDDCIIPGPISKNIKQPSEKiERIEIIALSLNDSQVTMDD 
TIQRLFVECRPYSLPAEBTPVSLPKPKSGQWVYYNYSNVIYVDK 
ENNKAKRDI LKAI LQKQEMPNRSLRPTWSDP PEDEQDLE CEDI 
GVAH VDLADMFQBGRDL I BQNIDVPDAHADGEGXGlCLRVTVBAL 
HALQSVYKQYRDDLEA 


6346 


2S21 


533 


QDRRIiLRLELQKTCQPTSTMSGSHTPACXSPFSAlbTPS I WPQE IL 
AKYTQKEESAEQPEFYYDEFGPRVYKEEGDEPGSSLLANSPLME 
DAPQRLRWQAHLEFTHNHDVGDLTWDKIAVSLPRSEKLRSLVLA 
GI PHGMRPQLWMRLSGALQKKRNSELS YREIVKNSSNDET I AAK 
QIEKDLLRTMPSNACFASMGSIGVPRIiRRVLRALAWtiYPEIGYC 
QGTGMVAACLLLPLEEEDAPWMMSAI 1 BDLLPAS YFSTTLLGVQ 
TDQRVIiRHT i T Vrt VT . PT? T.nVT.T.riPwnTgT.cT ttt umitt TUB^Aoinr 

DIKLIiLRlWDLFFYEGSRVLFQLTIiGMLHLKEEELIQSENSASI 
FWTLSDIPSQMEDAELLLGVAMRLAGSLTDVAVETQRRKHLAYL 
lADQGQLI/SAGTLTNLSQWRRRTQRRKSriTALLFGEDDLEAL 
KAKN I KQTELVADLREAILRVARHPOCTDPKNCS WSRQLPGLL 
PNTALTPPTPLVGLYSLWQELTPDYSMBSHQRDHENYVACSRSH 
RRRAKALLDFERHDDDEIiGPRKNDI I TI VSQKDEHCWVGEI»NGL 
RQWFPAKFVEVLDERSKEYSIAGDDSVTEGVTDLVRGTLCPALK 
ALFEHGLKKPSLLGGACHPWLFIEEAAGREVERDFASVYSRLVIj 
CKTFRLDEDGKVLTPEELLYRAVQSVWVTHDAVHAQMDVKLRSL 
ICVGLNEQVLHLWLE VLCSSLPTVEKWYQPWSPLRS PGWVQ I KC 

elrvlccfafslsqdwelpakreaqqplkegvrdmlvkhhlpsw 

DVDG 


6347 


2921 


533 


qdrruirlbl^ktcqptstmsashtpacgpfsaltpsiwpqeii, 
akytqkebsaeqpefyydefgfrvykeegdepgssllansplme 
dapqrlrwqahlefthnhdvgdltwdkiavslprsekiirslvla 
giphgpirpqlwmrlsgalqkkrnselsyreivknsswdetiaak 
qiekdllrtmpsnacfasmgsigvprlrrvlraiawlypeigyc 

QGTGMVAACLLLFLEEEDAFWMMSAI I EDLLPASYPSTTLLGVQ 
TD OR VLRHLI VO YL PRLDKLLORHDT RT. «5 T . T TT .Bw t?t a w a o\n 7 

DI KLLLRI WDLFPYBGSRVLFQLTIiGMLHLKEEELIQSENSAS I 
FNTLSDI PS®*4EDABLLXiGVAMRLAGSLTDVAVETQRRKHIiAYL 
lADQGQLLGAGTLTNLSQWRRRTQRRKSTITALLFGEDDIiEAL 
KAKNIKQTELVADLREAILRVARHFQCTDPKNCSWSRQLPGLL 
PNTALTPPTPrjVGLYSLWQELTPDYSMESHQRDHBNYVACSRSH 
RRRAKALIjDFERHDDDBLGPRKND X ITI VSQKDEHCWVGELNGL 
RGWPPAKFVEVLDERSKE YS lAGDDSVTEGVTDLVRGTLCPALK 
ALPEHGLKKPSLLGGACHPWLPrEEAAGREVERDFASVYSRLVti 
CKTFRLDEDGICVLTPEELrjYRAVQSVNVTHDAVHAQMDVKLRSL 
I CVGriNEQVLHLWXjEVLCSSLPTVSKW YQPWSFLRS PGWVQIKC 
EliRVLCCFAFSLSQDWEIiPAKREAQQPLKEGVRDMLVKHHLFSW 
DVDG 


6348 


3 


3679 


AGAEKCFVTLLACFLAKQQNKYKYEECKDLIKSMiiRWELQPKEE 
KIiAEQLKQAEELRQYKVLVHSQERELTQLREKLREGRDASRSLN 
EHLQALLTPDE PDKS QGQDLQEQLAEGCRLAQHLVQKLS PENDN 
DDDEDVQVEVAEKVQKSSSPREMQKAEEKEVPEDSLEECAITCS 
NSHGPCDSNQPHKNIKITFEEDEVNSTLWDRESSHDECQDALN 
ILPVPGPTSSATNVSMWSAGPLSGEKAAINILEINEKLRPQLA 
EKKQQFRNLKEKCFLTQLACFLANQQNKYKYEECKDLIKFMLRN 
ERQFKEEKLAEQLKQAEELRQYKVLVHSQERELTQLREKLREGR 
DASRSLNEHLQALLTPDEPDKSQGQDLQEQIAEGCRIiAQHLVQK 
LSPENDNDDDBDVQVEVAEKVQKSSAPREMPKAEEKEVPEDSLE 
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SEQ 
ID 
WO: 


1 Predicted 
beginning 
nucleotide 
location 
cor re spond ing 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicteci end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acxd segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, P^Phenylalanine. G^Glycine, 
H=Histidine, I=Isoleucine^ K=Lysine* 
L=Leucine, M=Methionine, N»Asparagine, 
P^Proline, Q=.Glut amine , R=^rginine, 
SsSerine, T=Threonine, V=Valine, 
W=rryptophan, Y«Tyro3ine, X«Unknown, *=Stop 
Codon, /impossible nucleotide deletion, 
\=possible nucleotide insertion) 








ECAITCSN5HGPYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEW ' 

EDAVHIIPENESDDEEEEEKGPVSPRNLQESEEEEVPQESWDEG 

YSTLSIPPEMLASYKSYSSTFHSLEEQQVCMAVDIGRHRWDQVK 

KEDHEATGPRLSRELLDEKGPEVLQDSLDHCYSTPSGCLELTDS 

CQPYRSAFYVLEQQRVGLAVNMDEIBKYQEVEEDQDPSCPRLSR 

ELLDEKEPEVLQDSLGRCYSTPSGYIjELPDLGQPYSSAVYSLEE 

QYLGLALDVDRIKKDQEEEEDQGPPCPRLSRELLEWEPEVLQD 

SLDRCYSTPSSCIiEQPDSCQPYGSSFyALBEKHVGFSLDVGEIE 

KKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKG 

PEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGLAV 

DMDEIEKYQEVEEDQDPSCPRLSGELLDEKEPEVLQESLDRCYS 

TPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMDEIEKYQEVEE 

DQDPSCPRLSRELLDEXEPEVLQDSIiGRCYSTPSGYLELPDLGQ 

PYSSAVYSLEEQYLQIiALDVDRIKKDQBEEEDQGPPCPRLSREL ' 

LEWEPBVLQDSLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKH 

VGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCP 

RIiNSMIjMEVEEPEVI.QDSLD ICYS TPSMYFELPDS FQHYRS VFY 

SFEEEHISPALYVDNRFPTIiTVTSLHLVFQKGVIFPQ 


6349 


3 

• 


3679 


AGAEKCFVTLLACFLAKQQNKYKyEECKDLIK3^!ILRNELQPKEB 

KLAEQLKQABELRQYKVLVHSQERELTQLREKLREGRDASaSLN 

EIILQALLTPDBPDKSQGQDLQEQLAEGCRLAQHLVOKLSPENDN 

DDDEDVQVEVAEKVQKSSSPREMQKAEEKEVPBDSLKECAITCS 

NSHGPCDSNQPHKNIKITFBEDEVNSTLWDRESSHDECQDALN 

ILPVPGPTSSATNVSMWSAGPLSGEKAAINILEINEPCLRPQIA 

EKKQQFRNLKEKCPLTOIACFLANQQNKYKYEECKDLIKFMLRN 

ERQFKEEKLAEQLKQAEELRQYKVLVHSQBRELTQLRBKLREGR 

DASRSLNEHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHIiVQK 

LSPENDNDDDEDVQVEVAEKVQKSSAPREMPKAEEKEVPEDSIiE 

ECAITCSNSHGPYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEW 

EDAVHIIPENESDDEEEEEKGPVSPRNLQESEEEEVPQESWDEG 

YSTLSIPPEMLASYKSYSSTFHSLEEQQVCMAVDIGRHRWDQVK 

KEDHEATGPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDS 

CQPYRSAPYVLEQQRVGLAVNMDEIEKYQEVEEDQDPSCPRLSR 

ELTiDEKEPEVLQDSLGRCYSTPSGYLEIiPDLGQPYSSAVYSLEE 

QYLGLALDVDR I KKDQEEEEDQGPPCPRi:.SREI)IiEVVEPE VLQD 

SLDRCYSTPSSCLEQPDSCQPY6SSFYALBEKHVGFSLDVGEIE 

KKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRI/SRELLDEKG 

PBVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYIIiEQQRVGIAV 

DMDBIEKYQBVEEDQDPSCPRLSGELXiDEKEPEVLQESLDRCYS 

TPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMDEIEKYQEVEE 

DQDPSCPRLSRELLDEKEPEVLQDSLGRCYSTPSGYLELPDLGQ 

PYSSAVYSLESQYLGIALDVDRIKKDQEEEEDQGPPCPRLSREL 

LEWEPEVLQDSUDRCYSTPSSCDEQPDSCQPYGSSFYALEEKH 

VGFSLDVGEIEKKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCP 

RLNSMLMEVEEPEVLQDSLDICYSTPSMYFELPDSFQHYRSVFY 

SFEBEHI SFALYVDNRFFTLTVTSLHLVFQMGVI FPO 


6350 


3 


3679 


AGAEKCFVTLLACPUUCQQNKYKYEECKDLI KSMLRNELQFKEE 
KLAEQLKQAEELRQYKVLVHSQERELTQLREKLRBGRDASRSLN 
EHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHIiVQKLSPENDW 
DDDEDVQVEVAEKVQKS SS PREMQKAEEKEVPEDSLEECAITCS 
NSHGPCDSNQPHKNIKITFEEDEVNSTLWDRESSHDECQDAUT 
ILPVPGPTSSATNVSMWSAGPLSGEKAAIHILEINEKLRPQLA 
EKKQQFRKLKEKCFIiTQLACFIiANQQNKYKYEECKDLIKPMLRN 
BRQFKEEKLAEQLKQAEELRQYKVX»VHSQERBLTQLREKLREGR 
DASRSLNEHLQALLTPDEPDKSQGQDLQEQLAEGCRLAQHLVQK 
LS PENDNDDDEDVQVBVAEKVQKS SAPREMPKAEEKEVPBDSLE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
bo first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=:Alanine, C-Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G=Glycine 
H»Histidine. I=Isoleucine, K=Lysine, 
L= Leucine, M=Methionine, N=Asparagine , 
P^Proline, Q==Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, y=Tyro3ins, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=pDssible nucleotide insertion) 


6351 






iSCAITCSNSHGPYDSNQPHRKTKITFEEDKVDSTLIGSSSHVEW 

EDAVHIIPENESDDEEEEBKGPVSPRNLQESEEBEVPQESWDEG 

YSTLSIPPEMLASYKSySSTFHSLEEQQVCMAVDIGRHRWDQVK 

KEDHEATGPRLSRELLDEKGPEVLQDSLDRCYSTPSGCLELTDS 

CQPYRSAPYVLSQQRVGLAVHMDEIEKYQEVEEDQDPSCPRLSR 

ELLDEKEPEVLQDSLGRCYSTPSQYLELPDLGQPYSSAVYSLEE 

QYLGLArj)VDRIKKDQEEEEDQGPPCPRLSRELLEWEPEVLQD 

SLDRCYSTPSSCLEQPDSCQPYGSSFYALEEKHVGFSLDVGEIE 

KKGKGKKRRGRRSKKERRRGRKEGEEDQNPPCPRLSRELLDEKG 

PEVLQDSLDRCYSTPSGCLELTDSCQPYRSAFYILEQQRVGLAV 

DMDEIBKYQEVEEDQDPSCPRLSGELLDEKEPEVLQESLDRCYS 

TPSGCLELTDSCQPYRSAFYILEQQRVGLAVDMDEIEKYQEVEE 

DQDPSCPRLSRELLDEKEPEVLQDSIiGRCYSTPSGYLEIiPDLGQ 

PYSSAVYSLEEQYLGLALDVDRIKKDQEEEEDQGPPCPRLSREL 

LEWBPEVLQDSLDRCySTPSSCI.EQPDSCQPyGSSFYALEEKH 

VGFSLDVGEIEKKGKGKKRRGRRSKKBRRRGRKBGEEDQNPPCP 

RLNSMLMBVEEPEVIiQDSLDICYSTPSMYFELPDSFQHYRSVFY 

SFEEEHISFALYVDNRFFTLTVTSLHLVFQMGVIFPQ 


6352 


1291 


319 


KEARRRTERSQLGRMLWEVANGRSLVWGAEAVQALRERLGVGG 
RTVGALPRGPRONSRLGLPriLLMPEETUiLIiAEIGAVTLVSAPRP 
DSRHHSLALTSPKRQQEESPQEQSAIAAEARETRRQELLEKITE 
GQAAKKQKLEQASGASSSQEAGSSQAAKEDETSDGQASGEQKEA 
GPSSSQAGPSNGVAPLPRSALLVOLATARPRPVKARPLDWVQS 
KDWPHAGRPAHELRYSIYRDLWBRGFFLSAAGKPGGDFLVypGD 
PLRFHTVHYIAQCWAPBDTIPLODLVAAGRTjG'PSVRPfTT t t neno 
PDGKWYTSLQWASLQ 


6353 


235 


923 


WSEWLSPCHAAKCKGLSMLRITMKTRAISLAADATEFVQGRSAP 

AMARSLVHDTVFYCLSVYQVKISPTPQLGAASSAEGHVGQGAPG 

LMGNMNPEGGVNHENGMNRDGGMIPEGGGGNQEPRQQPQPPPEE 

PAQAAMEGPQPENMQPRTRRTKFTLLQVEELESVFRHTQYPDVP 

TRRELAENLGVTEDKVRVWFKNKRARCRRHQRELMLANELRADP 
DDCVYIWD 


6354 


65 


672 


KfACiAGAIPKARARPPDVQAAEEEKEMDLPDSASRVFCGRlLSM 
VNTDDWAIILAQraJMLDRFSKTNEMLLNFNNLSSARLQQMSER 
FLHHTRTLVEMKRDLDSIPRRlRTLKGKLftRQHPEAFSHIPEAS 
FLBBEDEDPIPPSTTTTIATSEQSTGSCDTSPDTVSPSriSPGFE 
DLSHVQPGSPAINGRSQTDDEEMTGE 


— eTsS" " 


965 


510 


i'aiiilPMEPTRutJi'L.bGGAFSAjCtPMGArDVSDLRPVPlSNrQEVPC 

HPVTDQSLIVELLELQAHVRGEAAARYHFEDVfSGVQGARAVHVE 

SVQPLSLENLALRGRCQBAWVLSGKQQIAKENQQVAKDVTLHQA 
LLRLPQYQTDLLLTFNQPP 


6356 


158 


1662 


KesSAAFRGSQI»RGA^3IRRVLPHGMGRGLLTRRPGTRRGGFSLD 

WDGKVSElKKKIKSILPGRSCDLIiQDTSHLPPBHSDVVIVGGGV 

liGLSVAYWLKKLESRRGAlRVLWERDHTYSQASTGLSVGGICQ 
QFSLPENIOLSI*PfiAc;cT,RWT*NrcvT Ji^nmAnoT i^t nnvmrt^.,. - 

LASEKDAAAMESNVKVQRQEGAKVSLMSPDQLRNKFPWINTEGV 
ALASYGMEDEGWFDPWCLLQGLRRKVQSLGVLFCQGEVTRFVSS 
SQRMLTTDDKAWLKRIHBVHVKMDRSLBYQPVECAIVINAAGA 
WSAQIAAIAGVGEGPPGTLQGTKLPVEPRKRYVYVWHCPQGPGL 
ETPLVADTSGAYFRREGLGSNYIiGGRSPTEQEEPDPANLEVDHD 
FFQDKVWPHIiAIJlVPAFETLKVQSAWAGYYDYNTFDQWGWGPH 
PLWNMYFATGFSGHGLQQAPGIGRAVAEMVLKGRFQTIDIiSPP 
[^FTRFYLGEKIQENNII 




354 


633 

i! 


rGLTSSCLPLQVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREV 

\DSYAQNAKVIEKQLERKGMSKRRLQBLABIiEAKKAKMKGTLID 
JQPK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide - 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment: containing signal peptide " 
(A^Alanine, C=Cysteine, D=Aepartic Acid, E=: 
Glutamic Acid, F= Phenylalanine, Gx^Glycine, 
H^Histidine, I«Isoleucine, K=Lysine, 
li=lieucine , M=Methionine , N=:Asparagine , 
P=Proline, Q=Glutamine. R=Arginine, 
S=Serine, T=Threonine, V=VaIine, 
W=Tryptophan, Y=TyrQsine, X^Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6357 


2 


915 


GI,LRNMWJi.V«VLRNQTSISQWVPVCSRLIPVSPTQGQGDRALS 
RTSQWPQMSQSQACGGSEQIPGIDIQLNRECYHTTRKLSTTKDSP 
QP VEEKVGAFTKI lEAMGFTGPLKYSKWKIKIAALRMYTS CVEK 
TDFEEFFLRCQMPDTFNSWFLITLLHVWMCLVRMKQEGRSGKYM 
CRIIVHFMWEDVQQRGRVMGVKPY-ILKKNMILMTNHFYAAILGY 
DEGILSDDHGIaAAALWRTPFMRKCEDPRHr.ELLVEYVRKQIQyL 
DSKNGEDLLLTGEVSWRPLVEKNPQSILKPHSPTYKDEGL 


6358 


2009 


1040 


ASDALHSLSAPVLRLSSRSAARPATMTEQAISFAKDFLAGGIAA 
AISKTAVAPIERVKLLLQVQHASKOIAADKQYKGIVDCIVRIPK 
ECKSVliSFWRGNUVNVIRYFPTQALNFAFKDKYKQlFI^GVDKHT 
QFWRYFAGNIiASGGAAGATSIiCFVYPLDPARTRLAADVGKSGTE 
REFRGLGDCLVKITKSDGI RGLYQGFSVSVQGI r I YRAAYFGVY 
DTAKGMLPDPKNTHIWSWMIAQTVTAVAGWSYPFDTVRRRMM 
MQSGRKGADIMYTGTVDCWRKIFRDEGGKAFFKGAWSNVLRQMG 
GAFVLVX.YDELKECVI 


6359 
6360 


98 


1086 


VCRQEEEKMKEDCLPSSHVPISDSKSIQKSELLGLLKTYNCYHE 
GKSFQLRHREEEGTLIIEGLLNIAWGLRRPIRLQMQDDREQVHL 
PS TS WMPRRPSCPLKSPS PQNGNITAQGPS IQPVHK7VESSTDSS 
GPLEEAEEAPQLMRTKSDASCMSQRRPKCRAPGEAQRIRRHRPS 
INGHFYNHKTSVFTPAYGSVTNVRVNSTMTTLQVLTLLLNKPRV 
EDGPSEFALYIVHESGERTKLKDCEYPLISRILHQPCEKIARIF 
LMBADLGVEVPHEVaQYIKFEMPVLDSFVEKLKEEEERBIIKLT 
MKFQALRLTMLQRLEQLVEAK 




1 


345 


GTRGAVPSTiEEWIiPPRSCRVFWIHSGTTMSKVSFKITLTSDP 
RLPYKVLSVPESTPFTAVLKFAAEEFKVPAATSAIITNDGIGIN 
PAQTAGNVFLKHGSELRI IPRDRVGSC 


6351 


615 


158 


RPGLGQLQHCAIAPQAGNRRCRFHGRLHALTRSTHRGKPMSIMQ" 

FKDTLNTPLPDSSPVAVPLGAPIAVASTLSVEHNDGVETGIWAC 

APGRWRRQITSQEFCHFIQGRCTFTPDDGETLHIQAGnALMLPA 
NSTG IWDIQETVRKTYVLIL 


6362 


350 


1576 


TTMDGSHSAAI^KLQQLPPTSSSSAVSEASFSyKENLIGALLAIF 
GHLWSIALKLQKYCHIRLAGSKDPRAYFKTKTWWLGLFLMLLG 
ELGVFASYAFAPLSLIVPLSAVSVIASAIIGIIFIKEKWKPKDF 
LRRYVLSFVGCGLAWGTYLLVTFAPNSHEKMTGENVTRHLVSW 

ppllymlveiilfclllyfy:<eknanniwilllvai^smtvv' 

TVKAVAGMLVI^ IQGNLQLDYPIFYVMFVCMVATAVYQAAPLSQ 

asqmydssliasvgyilsttiaitagaifyldfigedvlhicmf 

ALGCLIAFLGVFLITRNRKK?IPPEPYISMDAt4PGMQNMHDKGM 

tvqpelkasfsygalenndniseiyapatlfvmqeehgsrsasg 
vpyrvlehtkke 


6363 


21 


1201 


RRTR]^SSFFRRRDSSAMESYDVIANQP\A/IDNGSGVIKAGFAG~ 
DQIPKYCFPNYVGRPKHVRVMAGALEGDIFIGPKAEBHRGLLS X 
RYPMBHGrVKDWNDMERIWQYVYSKDQLQTFSEEHPVnDTEAPL 
NPRKNRERAAEVFFBTFNVPALPISMOAVLSLYATGRTTnwT.n 

SGDGVTHAVPIYEGFAMPHSIMRIDIAGRDVSRFLRIiYLRfCEGY 
DFHSSSEPEIVKAIKERACYLSINPQKDETLETEKAQYYDPDGS 
TIEIGPSRPRAPELLPRPDLIG&IKSEGrHEVLVFAIQKSDMDLR 
RTLFSNIVLSGGSTLFKGFGDRLLSEVKKLAPKDVKIRISAPQE 
RliYSTWIGGSlLASLDTFKKMWVSKKEYBBDGARSIHRKTP 




21 


1201 

: 


RRTRIiGSS PPRRRDS3AMES YDVIANQPWTDNOSGVI KKgFAG 
DQIPKYCFPNYVGRPKHVRVMAGALEGDIFIGPKAEEHRGLLSI 

rypmehgivecdwndmeriwqyvyskdqlqtfseehpvllteapl 
mprknreraabvffetfmvpalfismqavlslyatgrttgwld 
sgdgvtkavpiyegfamphsimridiagrdvsrflrlyiirkegy 
dfhsssefeivkaikeracylsinpqkdetletekaqyylpdgs 

PIEIGPSRFRAPEriFRPDLXGEESEGIHEVLVFAXQKSDMDLR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(Aa Alanine, C=Cysteine, D=Aspartic Acid^ E= 
Glutamic Acid, F=^ Phenyl alanine, G=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L=Ijeucine, M=Methionine, N=Asparagine, 
PisProline, Q^lutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
Ms=Tryptophan, Y=Tyroaine, X'sUnknown, *-.Stop 
Codon, /-possible nucleotide deletion, 
\-possible nucleotide insertion) 








RTLFSNIVLSGGSTLPKGFGDRLLSEVKKLAPICDVKIRISAPQE 
RLYSTWIGGSILASLDTFKKMWVSKKEYEEDGARSIHRKTF 


6355 


234 


1939 


KHKSRASCAARAQAFGPSREREVHSRFR2GLRRL6ESNSGCCTM 
ASMGTLAFDE YGRPFL 1 1 KDQDRKSRLMGLEALKSHIKAAKAVA 
NTMRTS LGPNGLDKKMVDKDGDVTVTNDGATILSMMDVDHQ I AK 
LMVELS KSQDDE I GDGTTGVWLAGALLEBAEQLLDRG IHPIRI 
ADQ YEQAARVAIEHLDKI SDSVLVDI KDTEPI. I QTAKXTLGSKV 
VNS CHRQMAE lAVNAVLTVADMERRDVDFEL I KVEGKVGGRLBD 
TKL I KG VI VDKDFSHPQMPKKVEDAKI AILTC PFEPPKPKTKHK 
LDVTS VEDY KALQKYFKKKFEEMIQQ I KETGANLAICQWGFDDE 
ANHLLLQNNLPAVRMVGGPSIELIAIATGGRIVPRFSELTAEKL 
GFAGLVQE IS FGTTKDKMLVIEQCKNSRAVTI FIRGGNKMI I EE 
AKRSLHDALCVIRNLIRDNRWYGGGAAEISCALAVSQEADKCP 
TLEQYAMRAFADALEVI PMALSENSGMNPIQTMTE VRARQVKEM 
NPALGIDCrJiKGTNDMKQQHVIETI»IGKKQQISLATQMVRMILK 
IDDIRKPGESEE 


6366 


257 


1898 


GNKEGAHSSTFW VLLS I FLGAVAMLCKEQGITVLGLNAVPDILV 
IGKFNVLEIVQKVLHKDKSLENLGMLRNGGLLFRMTLLTSGGAG 
MLYVRWRIMGTGPPAFTEVDNPAS FADS^^LVRAVNYNYY YSLNA 
WLLLCPWWLCFDWSMGCIPLIKSISDWRVIALAALWFCLIGLIC 
QALCSBDGHKRRILTLGLGFLVIPFIjPASNLFFRVOFWAERVXi 
YLPSVGYCVLLTFGFQAIjSKHTKKKKLIAAWIiGILFINTIiRCV 
LRSGEWRSEEQLFRSALSVCPLNAKVHYWIGKNLADKGNQTAAI 
RyYREAVRtiNPKYVHAMNNLGNILKERNEL3EAEELLS LAVQIQ 
PDFAAAWMNLGIVQNSLKRFEAAEQSyRTAIKHRRKYPE)CYYNL 
GRLYADLNRHVDALNAWRNATVLKPEHSIAWNNMIILLDNTGKL 
AQAEAVGREALELIPNDHSLMFSLANVLGKSQKYKESEALFLKA 
IKANPNAASYHGNIAVLYHRWGHLDLAKKHYEISLQLDPTASGT 
KEN YGL LRRKLBLMQECKAV 


6367 


287 


1934 


SIGFPVMLVLSILLYTCEMFQDSVAFEDVAVSFTQEEWALLDPS "" 

QKNLYRDVMQBTFKNLTSVGKTWKVQWIEDEYKNPRRNLSLMRE 

KLCES KESHHCGESFNQIADDMLNRKTLPG ITPCESSVCGEVGT 

GHSSLNTHIRADTGHKSSEYQEYGENPYRNKECKKAFSYIiDSPQ 

SHDKACTKEKPYDGKBCTETFISHSCIQRHRVMHSGDGPYKCKF 

CGKAFYFLNLCLIHERIHTGVKPYKCKQCGKAFTRSTTLPVHER 

THTGVNADECKECGMAFSFPSE IRRHKRSHTGEKP YECKQCGKV 

PISFSSIQYHKMTHTGEKPYBCKQCGKAFRCGSHLQKHGRTHTG 

EKPYECRQCGKAFRCTSDLQRHEKTKTEDKPYGCKQCGKGFRCA 

SQLQIHERTHSGEKPHECKECGKVFKYFSSIiRIHERTHTGEKPH 

ECKQCGKAFRYFSSLHIHERTHTGDKPYECKVCGKAFTCSSSIR 

YHERTHTGEECPYECKHCGKAFISNYIRYHERTHTGEKPYQCKQC 

GKAFIRASSCE^HERTHTIHR 


6368 


1 


327 


RPVPAKLNPRSWPRTAGALPLRP PPLTMAVFHDEVE I EDFQYDE 
DSETYFYPCPOGDNFSITKEDLENGEDVATCPSCSLIIKVIYDK 
DQFVCQETVPAPSANKELVKC 


636S> 


1 


1745 


AGCCRDTRFPTPRGPGS LCHNFCRSAACTVTRTIHGS PREI>TGt 
PRSREMMFQDSVAFEDVAVSFTQEEWALLDPSQKNLYRDVMQET 
FKNLTS VGKTWKVQNI EDEYKNPRRNLS LMREKLCES KESHHCG 
ESPNQIADDMLNRKTLPGITPCESSVCGEVGTGHSSLNTHIRAD 
TGHKSSEYQEYGBNPYRNKECKKAFSYLDSFQSHDKAC7KSKPY 
DGKSCTETPrSHSCIQRHRVMHSGDGPYKCKFCGKAFYFLNLCL 
IHERIHTGVKPYKCKQOGKAFTRSTTIiPVHBRTHTGVNADECKE 
CGNAFSFPSEIRRHKRSHTGEKPYE CKQCGKVPISFSS IQYHKM 
THTGEKPYECKQOSKAFRCGSHLQKHGRTHTGEKPYECRQCGKA 
FRCTSDLQRHEKTHTEDKPYGCKQCGKGFRCASQLQIHERTHSG 
EKPHECKECGKVFKYFSSLRIHERTHTGEKPHECKQOGKAFRYF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first: 
amino acid 
residue of 
amino acid 
sec[uence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=rCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=:Histidine, I=Isoleucine, K=Lysine, 
L=iLeucine, M=Methionine, N^sparagine^ 
P=Proline, Q«Glutamine, R=^ginine, 
S=Serine, TaThreonine, V^Valine, 
W=Tryptophan, Y=*Tyrosine, X=Uiiknown, *=»Stop 
Codon, /=possible nucleotide deletion, 
\-possible nucleotide insertion) 








SSLHlHBRTHTGDKPYECKVCGKAFTCSSSlfeYHERTHTGEKPy " 

ECKHCX3KAPrs»yiRYHERTHTGEKPYQCKQCGKAFIRASSCRE 
HERTHTINR 


6370 


1711 


329 


FVLSEQRI.RTERTWPRSPGLGRGAAAAGARTAGAGLLRI.LLGCG ■ " 
ALVGGLRPVTMTTPANAQNASKTWELSLYELHRTPQEArMDGTB 
lAVSPRSLHSELKCPICLDKLKNTMTTKECLHRFCSDCrVTALR 
SGNKBCPTCRKKLVSKRSrj^PDPNFDALISKIYPSRBEYEAHQD 
RVX>IRljSRLHNCX3ALSSSIEEGLRMQAMHRAQRVaRPIPGSDQT 
TTMSGGBGEPGEGEGDGEDVSSDSAPDSAPGPAPKRPRGGGAGG 
SSVGTGGGGTGGVGGGA6SEDSGDRGGTLGGGTLGPPS P PGAPS 
PPBPGGEIELVFRPHPLLVEKGEYCQTRYVKTTGNATVDHLSKY 
! LALRIALERRQQQEAGEPGGPGGGASDTGGPDGCGGEGGGAGGG 
DGPESPALPSLEGVSEKQYTIYIAPGGGAFTTLNGSLTLELVNE 
KFWKVSRPLELCYAPTKDPK 


6371 


3 


288 


GVANMSTAMNFGTKSFQPRPPDKGSFPLDKLGECKSFKEKJMkt" 

LHKNNFENALCRKESKEYLECRMERKLMLQEPLEKLGFGDLTSG 

KSEAXK 


53 72 


2141 


625 


RVSArASEGKAEERYKKLEDLLEKSFSbVKMPSLQPWMCVMKH " 

LPKVPEKKLKLVMADKELYRACAVEVRRQIWQDNQALFGDEVSP 

LLKQYILEKESALFSTEIiSVLHNFFSPSPKTRRQGBWQRLTRM 

VGKNVKLyDMVliQFLRTLFr*RTRNVHYCTLRAELLMSLHDLD\7G 

EICTVDPCHKFTWCLDACIRERFVDSKRARELQGFLDGVKKGQE 

QVLGDLSMILCDPPAlNTLALSTVRHIiQELVGQETLPRDSPDLL 

LLLRLLALGQGAWDMIDSQVFKEPKMEVELI TRFLPMLMS FLVD 

DYT FUVDQKLPAEEKAPVS YPNTLPES FTKFLQEQRMACEVGLY 

YVLHITKORNKNALLRLIiPGLVETFODLAFGDI FLHLLTGWLAL 

LADEFAi:,EDFCSSbFIX3PFI,TASPRKENVHRHAIjRr,riIHLHPRV 

APSKLEALQKALEPTGQSGEAVKELYSQLGEKLEQLDHRKPSPA 

QAAETPALBLPLPSVPAPAPL 


6373 


67 


711 


PSRAARASPARLPAMVSWirSRLWLIFGTLYPAYY^YKAVKSK 
DIKEYVKWMMYWXIFALFTTABTFTDIFLCWPPFYYELKIAPVA 
WLLS P YTKGSSLLYRKFVHPTLSSKE KE IDDCLVQAKDRS YDAL 
VHFGKRGLNVAATAAVMAASKGQGALSERLRSFSMQDLTTIRGD 
GflPAPSGPPPPGSGRASGKHGQPKMSRSASESASSSGTA 


S374 


535 


210S 


HKLFCSYISTSEFPSSTRHHSCPTHTFCNYTSSTIFLSSTRDHS 
CPTHTFCETYTSSTrFLSSTRDHSCPTHTSCNYTSSTIFLSSTRD 
HSCPTHTSCNYTSSTIFLSSTRDHSCPTHTFCNYPRPIIRLSSC 
CPAELQTBGSNGKKEVLSGPQWLEDTVLPPEGGGQPDDRGTIN 
DISVLRVTRRGBQADHFTQTPLDPGSQVLVRVDWERRFDHMQQH 
SGQHLirAVADHLFiCliKTTSWELGRFRSAIEIjDTPSMTAEQVAA 
lEQSVNEKIRDRLPVNVRELSLDDPEVEQVSGRGLPDDHAGPIR 
WNIBGVDSNMCCGTBiVSNLSDLQVIKILGTEKGKKNRTNLIFL 
SGNRVliKWMERSHGTEKALTALLKCGAEDHVEAVKKI,QNSTKIL 
QKNNLNLIjRDLAVHI AHSLRWSPDWGGWILHRKEGDSEFMN 1 1 

aneigse3tllfltvgdekggglpliagppasvbtlgprvaevi, 
egkgagkkgrpqgkatkmsrrmeaqallqdyistqsaks 


6375 


1 


1535 


aimaaatrpvrlpeagcegrercwnpsrsrshsgeogiiaawsrt 
cpgrprrpgqqwrgptmlvtaylafvgllasclglelsrcrak 

PPGRACSNPSFLRFQIiDFYQVYFLALAADWLQAPYXiYKLYQHYY 
FLEGQIAILYVCGLASTVLFGLVASSLVDWLGRKNSCVIiFSLTY 
SLCCLTKLSQDYFVLLVGRALGGLSTALLPSAPEAWYIHEHVER 
HDPPAEWr PATFARAAPMNHVLAWAGVAAEAVAS WIGLGPVAP 
FVAAI PliLAIiAGAIiALRNWGENYDRQRAFSRTCAGGLRCLLSDR 
RVtiLUSTIQTVLFES VI PI FVPLWTP VLDPHGAPLGII FSSFMAA 
SLUSSSLYRIAXSKRYHIiQPMHLLSIAVLIWFSLFMIiTFSTSP 
GQESPVBSFtAPI,IilEIACGLyFPSMSFLRRKVIPETEQAGVLN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
bo first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C-Cysteine, D=:Aspartic Acid, 
Glutamic Acid, F= Phenylalanine, G^Glycine, 
H=Histidine, I=Iaoleucine, K=Lysine, 
L^Leucine, M=Methiontne, N^Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S-Serine, T= Threonine, V«Valine, 
W=Tryptophan, Y=Tyrosine, X-Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\=posBible nucleotide insertion) 


6376 






WFRVPLHSLACIXSLLVIJiDSDRKTGTRKMFSICSAVWVMfl^ 
VGLFTWRHDAELRVPSPTEEPYAPEL 


6377 " 


380 


1437 


ISSTDIDHYRFSFLVNSKMPSKESWSGRKTNRAAVHKSKQEGRQ 
QDLLIAALGMKLGSPKSSVTIWQPLKLFAYSQLTSLVRRATLKE 
NEQIPKYEKIHNPKVHTFRGPHWCEYCANFMWGLIAQGVKCADC 
GLm^QCSKMVPNDCKPDLKH\nCKVYSCDLTTLVKAHTTKRPM 
WDMCIREIESRGLNSEGLYRVSGFSDLIEDVKMAFDRDGEKAD 
ISVNMYEDINIITGALKLYFRDLPXPLITYDAYPKFIESAKIMD 
PDEQLETLHEALKLLPPAHCETLRYLMAHLKRVTLHEKENLMNA 
ENI.GI VFGPTLMRS PELDAMAALNDIRYQRLWELLI KNEDILF 


6378 


2311 


1845 


SRIRRRSSRRPREPPGPSRRRRRRRPDPRTMPSEKTPKQRRTFE 
QRVEDVRLIREQHPTKIPVIIERYKGEKQLPVLDKTKFLVPDHV 
NMSELIKirRRRbQIiNANQAFFLLVNGHSMVSVSTPlSEVYESE 
KDEDGPLYMVYASQBTFGMKLSV 




686 


191 


GAGPWEAFPDGIGRRSRRARLPQYKRPPGRVGGGDSGRRJJMAVA 
DLAblPDVDIDSDGVFKYVLIRVHSAPRSGAPAAESKEIVRGYK 
WAEYHADIYDKVSGDMQKQGCDCECLGGGRISHQSQDKKIHVYG 
YSMAYGPAQHAISTEKIKAKYPDYEVTWANIX5Y 


6379 


35 


378 


BRAGSPSPtiKAALRRCAPQRSQAPRWPDRAACRRSFQGSQGRAY 
LFNSWNVGCGPAEERVLLTGLHAVADIYCENCFCTTLGWKYEHA 
FESSQKYKEGKYIIELAHMIKDlfGWD 


S380 
6381 


1414 


462 


pavqgqrgagpptgrgsgwmarfAltwrhgetrfnkekiiqgq"" 

GVDEPLSETGFKQAAAAGIFLNKVKPIHAFSSDLMRTKQTMHGI 

lerskfckdmtvkydsrlrerkygwegkalselramakaareb 

CPVFTPPGGETLDQVKMRGIDPFEPLCQLILKEADQKEQPSQGS 
PSNCLETSIAEIPPLGKNHSSKVNSDSGIPGIiAASVLWSHGAY 
MRSLFDYFLTDLKCSLPATLSRSELMSVTPNTGMSLFIINFEEG 
REVKPTVQCI CMNLQDHLNGIiTENS LGIiNI.P*: IfqwHPT? dt vnxin 
lALFTSLLC 


6382 


1668 


218 


AVVRAQGSRGFSGAGWRPRQAAAMNFSEVFKLSSLLCKPSPDGK 

YLASCVQyRLWRDVWTLQrLQLYTCLDQIQHIEWSADSLFILC 

AMYKRGLVQVWSLEQPEWHCKIDEGSAGLVT^CWSPDGRHILNT 

rEPHLRITVWSLCrKSVSYIKYPKACLQGITFTRDGRYMALAER * 

RDCKDYVS IFVCSDWQLLRHPDTDTQDLTGIEWAPNGCVLAVWD 

TCLEYKILLYSLDGRLIiSTYSAYEWSLGIKSVAWSPSSQFIAVG 

SYDGKVRILNHVTWKMITEFGHPAAINDPKIWYKEAEKSPQLG 

IiGCIiSFPPPRAGAGPLPSSESKYEIASVPVSLQTLKPVTDRANP 

KIGIGMIiAFSPDSYFLATRNDNtPNAVWVWDIQKLRLPAVLEQL 

SPVRAPQWDPQQPRLAICTGGSRLYLWSPAGCMSVQVPGEGDFA 

VLSLCWHLSGDSMALLSKDHPCLCFLETEAWGTACRQLGGHT 


6383 


2 


1062 


FKHDEDRNLCLIAYPbKGDHGIVDIVDNSDCEPKSKLLRWTTNK 
KHHVLETBKTPKDWVRQHRKEEKMKSHKLEEEFEWIiKKSBVLYY 
TVEKKGNISSQLKHYNPWSMKCHQQQLQRMKENAKHRNQYKFIL 
LENLTSRYEVPCVLDLKMGTRQHGDDASEBKAANQIRKCQQSTS 
V K. V wurwy V I yA^abLFUiiMt WNKYHGRKLSVQGFKEALPQPF 

hngrylrrellgpvlkkltelkavlerqesyrfysssllviyug 

KERPEWLDSDAEDLEDLSEESADESAGAYAYKPIGASSVDVRM 

IDFAHTTCRLYGEDIWHEGQDAGYIFGLQSLIDIVTErSEESG 
E 




3159 


1061 

< 

C 

1 


SPAPGRPSPKGSQPAARAAAAPAMPSAKQRGSKGGHGAASPSEK 
3AHPSAARPLAAPTPAAPACRS PS PGGAPAS FPGRAPRSLAS QP 
^ARAAAAPAMPSAKQRGSKGGHGAASPSEKGAHPSGGADDVAKK 
PPPAPQQPPPPPAPHPQQHPQQHPQNQAHGKGGHRGGGGGGGKS 
SSSSSASAAAAAAAASSSASCSRRLGRALNFLFYLALVAAAAFS 
3WCVHHVLEEVQQVRRSHQDFSRQREELGQGLQGVBQKVQSLQA 
PFGTFESILRSSQHKQDLTEKAVKQGESEVSRISEVLQKLQNEI 
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SEQ 
ID 
NO: 


Predicted 
becfinning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predxcted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
{A?=Alanine, C^Cysteine, D=A3partic Acid, E= 
Glutamic Acid, F«Phenylalanine, G=Glycine, 
H=ttistidine, I=Isoleucine, K=Lysine, 
L=Iieucine, M=Methionine, N^Asparagine, 
P=:Proline, Q=Glutaraine, R^Arginine, 
S=Serine, T=Threonine, V^Valine, 
W^Tryptophan, Y^Tyrosine, X-Unknown, *=3top 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








LKDLSDGIHWKDARSRDFTSLENTVEERLTELTKS INDNIAI F 
TEVQKRSQKEINDMKAKVASLESSEGNKQDLKALKEAVKEIQTS 
AKSREWDMEALRSTLQrMESDirrEVRELVSLKQEQOAFKEAAD 
TERLALQALTEKLLRSEESVSRLPEEIRRLEEELRQLKSDSHGP 
KEDGGPRHSEAFEALQQKSQGLDSRLQHVEDGVLSMQVASARQT 

GETQLVLYGDVESIiKRSVGELPSTVESLQKVQBQVHTLLSQDQA 
QAARrjPPQDPLDRIiSSLDNIiKASVSQVBTUDLKMLRTAVDSLVAY 
SVKIETKENNLESAKGIiLDDLRNDLDRLFVKVEKIHEKV 


6384 


73 8 


1904 


IWEVPVCLTHLLHLQQANQPLPPPSSSINEEDADEANRAIGEKR 
AAPDSGKKPKTPKTKQQKDPNEPQKPVSAYALPFRDTQAAIKGQ 
NPNATFGEVSQIVASMWDSLGEEQKQVYKRKTEAAKKEYLBCALA 
AYRASriVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNTPLSQ 
HGTVSASPQTLQQSLPRSIAPKPLTMRLPMWQIVTSVTIAANWP 
SNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQ 
QQQMQQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQH 
LQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI 


6385 


2 


1584 


PRVR7WU3VAAGAQAWSAGMAKSNGENGPRAPAAGESLSGTRES 
lAQGPDAATTDSLSSLGSDSEANGPAERRIDKFGFIVGSQGAEG 
ALEEVPLBVLRQRESKWI,DMLNWWDKWMAKKHKKIRLRCQK(4IP 
P5LRGRAWQ YLSGGKVKLQQNPGKFDELDMSPGDP KWLDVI ERD 
IjHRQFPFHEMFVSRGGHGOOnt.PTJVT.If aVTT v'Di5n'T3>r«vr«nxrtT\ n 

lAAVIiLMHMPAEQAFWCLVQICEKYLPGYVSEKLEAIQLDGEIL 
FSLLQKVSPVAHKHLSRQKIDPLLYMTEWPMCAFSRTLPWSSVL 
RVWDMFFCEGVKI IPRVGLVLLKHALGS PEKVKACQGQYETIER 
LRSLSPKIMQBAFLVQEWELPVTERQIEREHLIQLRRMQETRG 
ELQCRSPPRIjHGAKAILDAEPGPRPAI*QPSPSIRLPIiDAPtiPGS 
KAKPKPPKQAQKEQRKQMKGRGQLEKPPAPNQAMWAAAGDACP 
PQHVPPKDSAPKDSAPQDLAPQVSAHHRSQESLTSQESEDTYL 


63B6 


819 


195 


TVCGSFYLGIMQRASRLKREIJIMIATEPPPGITCWODKDQMDDL 
RAQI LGGANTPYEKGVFKLEVI IPERYPFEPPQIRFLTPI YHPN 
IDSAGRICLDVLKLPPKGAWRPSLNIATVLTSIQLLMSEPNPDD 
PLMADISSEFKYNKPT^LKNARQWTEKHARQKQKADEBEMLDNL 
PEAGDSRVHNSTQKRKASQLVGIEKKFHPDV 


6387 


1 


662 


PGPTHASADAWADAWAQPNMAMHNKAAPPQI PDTRRELAELVKR 
KQELAETIiANLERQIYAFEGSYLEDTQMYGNIIRGWDRYLTNQK 
NSNSKNDRRNRKE'KKAERLFSKSSVTSAAAVSALAGVQDQLIEK 
REPGSGTESDTSPDPHNQENEPSQEDPEDLDGSVQGVKPQKAAS 
STSSGSHHSSHKKRKNKNRHSPSGMFDYDFEIDLKIiNKKPRADY 


6388 


1 


662 


PGP THAS ADAWADAWAQPNMAMHNKAAPP QI PDTRRELAELVKR 
KQELAETLANLERQIYAFEGSYLBDTQMYGNIIRGWDRYLTNQK 
NSNSKNDRRNRKFKEAERLFSKSSVTSAAAVSALAGVQDQLIEK 
RBPGSGTESDTSPDFHNQENEPSQEDPEDLDGSVQGVKPQKAAS 
STSSGSHHSSHKKRKNKNRHSPSGMFDYDFBIDLKLNKKPRADY 


6369 


1074 


497 


AEPGDRMAGHRLVLVL6DLH I PHRCNSLPAKPKKLL VPGKIQH I " 

LCTGNLCTKESYDYLKTLAGDVHIVRGDFDENLNYPBQKVVTVG 

QFKIGLIHGHQVIPWGDMA3LALLQRQFDVDILISGHTHKFBAF 

EHENKFYIKPGSATGAYNALETWIIPSFVLMDIQASTWTYVYQ 

LIGDDVKVERIEYKKP 


6390 


158 


535 


GEERKEGRAPGKAFAPERNPAKMEKEETTRELLLPNWQGSGSHG 
LriAQRDDGVFVQEVrQNSPAARTGWKEGDQIVGATIYPDNLQ 
SGEVTQLLNTMGHHTVGLKLHRKGDRFP PS LGQTWDP 


6391 


5386 


2897 


VRWNSKTECYLSIQTQENFPANLNELVWCIVISSLVTTQRKLJEiiV 
MSLLGSRNQLARAVLNPNPMDFCTKDLLTTTSERI lAYLRDFNE 
DQKKAIETAYAMVKHSPSVAKICLIHGPPGTGKSKTIVGLLYRL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acxd segment containing signal peptide - 
<A=Alanxne C=Cysteine, D.Aspartic Acid! 
Glutamic Acid, F« Phenylalanine, G^Glycine, 
H-Histidme, I=Isoleucine, K=Lysine 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=rhreonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X^Vnkkown, *«stop 
Codon, /^possible nucleotide deletion, 
\:=possible nucleotide insertion) 


S392 






Ki-iiNQRKGHSDJSNSNAKIKQNRVLVCAPSNAAVDJaLMKKIILEF" 

MKKELPSKVQAMHKRKEFL0YQLDELSRQRALCRGGREIQRQEL 
DENISKySKERQELASKIKEVOGRPOKTOSIIILPQUTTr-r-rT o 

TSGGLLLESAFRGQGGVPFSCVIVDEAGQSCEIETLTPLIHRCN 

!^i;^^y^^^^Q^^^V^SMKAQEYGYDQSMMSJiFCRLLEENVEHN 

MISRLPILQLTVQYRMHPDICLPPSNYVYNRNLKTNRQTEAIRC 

SSDWPFQPYLVFDVGDGSERRDNDSVINVQEIKLVMEIIKLIKD 

KRKDVSPRNIGIITHYKAQKTMIQKDLDKEFDRKGPAEVDTVDA 

FQGRQKDCVIVTCVRANSIQGSIGFLASLQRLNVTITRAKYSLF 

ILGHLRTLMENQHWNQLIQDAQKRGAIIKTCDKNYRHBAVKILK 

L.KPVLQRSLTHPPTIAPEGSRPQGGLPSSKLDSGPAKTSVAASL 

^^'fx^n^^^^^^^'^^^'^^^^'^^^^^Q^PRLLKRMGIEVKQG 

IFLWDPQPSSPQHPGATPPTGEPGFP\A/HQDLSHVQQPAAWAA 

nsSHKPPVRGEPPAASPEASTCQSKCDDPEBELCHRREARAFSE 

GEQEKCGSETHHTRRNSRtroKRTIiEQEDSSSKKRKLL 


6333 


972 
2017 


186 


bRTGVDIASaWAHRLQIRIiLTWDVKDTLLRLRH^UJKAYATKAR 

AHGLEVEPSALBQGFRQAYRAQSHSPPNYGLSHGLTSRQWWLDV 

VLQTFHLAGVQDAQAVAPIAEQLYKDFSHPCXMQVLDGAEDTLR 
ECRTRGLRLAVlSNFDRRLEGII.GCT^T,T?PXJT7npi7T a.o»>. 

KPDPRIFQEALRtAHMEPWAAHVGDNYLCDYQGPRAVGMHSPL 
WGPQALDPWRDSVPKEHILPSLAHLLPALDCLEGSTPrtT. 






730 


uw::^iuviAAVATCGSVAASTGSAVATA5KSNVTSF0RRGPRASVT 

NDSGPRLVSIAGTRPSVRNGQLLVSTGLPALDQLLGGGLAVGTV 

LLIEEDKYWIYSPLLFKYFIAEGIVNGHTLLVASAKEDPAUXLO 

?i:nf^o^''^^^^^^^^^^^^PESNIKMKIAWRYQLLPKME 

IGPVSSSRFGHYYDASKRMPOEr.TPa<:MMuniri7T nr,r,-r« , ,^ 

*'^**^-'"'"*^Wisiji.iiASNWHGFFLPEKISSTLKV 

EPCSLTPGYTKLLQFIQNXIYEEGFDGSNPQKKQRNILRIGION 
LGSPLWGDDICCABNGGNSHSLTKPLYVLRGLLRTSLSACIITM 
PTHLIQNKAIIARVTTLSDWVGLESFIGSERETNPLYKDYHGr. 
IHIRQiPRLNNLXCDESDVKDtAFfCLKRKLFTIERLHLPPDLSD 
TVSR3 5KMDLAESAKRLGPGCGMMAGGKKHLDF 


6395 


i4m 


511 


OAAAKGEGARkkPAAi^TVMAAVAAERAVLEEEKRWLLHDBVHA" 

yLKQLQDILKEASLRFTLPGSGTEGPAKQENFILGSCGTDQVKG 

VLTLQGDALSQADVNLKMPRNNQLLHPAFREDKQWKLQOIODAR 

nhvsqaiylltsrdqsyqfktgaevlio^avmloltSr^lt 

TPATLTLPEIAASGLTRMFAPATiPSDLLVNVYINLNKtiCLTVYQ 

lhalqpnstknprpaggavlhspgamfewgsqrlevshvhkvec 

VIPWLNDALVYFTVSLQLCQQLKDKISVFSSYWSYRPP 


^396 


13 


658 


i:'titiKF-i-J^PLCCAARRGAARHGGSVSGWPAGRYi:>TETSNPGSSVM~ 

esvtfedvavepiqewalldsarrslckyrmldqcrtlasrgtp 

PCKPSCVSQLGQRAEPKATERGILRATGVAWESQLKPEEIiPSMO 
^i^:!5^^^^"^'^^^°^^^^Q^VPSIEBRETPLTREDRPAL0K 
PPWSLGCTGLKAAMQIQRWIPVPTLGHRMPWVARDSnR 


63 97 


1 


1221 

C 
I 
3 
1 


IKESLKQILEESDSRQrFYFLCLNLLFTFVELFYGVLTNSLGLl 
3DGFHMLFDCSALVMGLFAALMSRWKATRIFSYGYGRIEILSGF 
ENGLFLIVIAPFVFMSSVARLIDPPELDTHMLTPVSVGGLIVNL 
CGICAFSHAHSHAHGASQGSCHSSDHSHSHHMHGHSDHGHGHSH 
7SAGGGMMANMRGVFLHVLADTLQSIGVIVSTVLIEQFGWFIAD 
'LCSI.FIAZliIFL5WPriiKDACQVLLLRLPPErYEJCBIiHIAI,EK: 
QKIEGLISYRDPHFWRHSASIVAGTIHIQVTSDVLEQRIVQQV 
j^^LKDAGWNLTIQVEKEAYFQHMSGLSTGFHDVIJ^ 




391 


122 G 
K 


AmjVGRFEAlHAPARMIEWCNDRLGKKVRVKCNTDDriQDLK ' 
LIAAQTGTRWNKIVLKKWYT I FKDHVSLGDYEIHDGMWLELYY 



498 
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SEQ 
ID 
WO: 


Predxcced 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

Segxience 


Arnxno aeia segment containing signal peptide " 
(A=Alanine. C=Cysteine, D=.Aapartic Acid, 
Glutamac Acid, F-Phenylalanine, G^Glycine 
H=Hi3tidine, I=Isoleucine, K^Lysine, 
L=:Leucine, M=Methionine, N=Asparagine 
P^Proline. Q=Glut amine, R=Arginine, 
S-Serine, T=Threonine, V=Valine, 
W^Tryptophan, y=Tyro3ine, X=Unknown, *«Stop 
Codon, /^.possible nucleotide deletion, 
\«possible nucleotide insertion 1 
Q ■ _ 


5398 
6399 


353 


1306 


HKQMGPLINKCKKILLPTTVPPATMRIWLLGGLLPmiXSGLQ- 

RPTEGSEVAIKIDFDFAPGSFDDQYQGCSKQVMEKLTC2GDYFTK 

DIEAQKNYFRMWQKAHLAWLNQGKVLPQNMTTTHAVAILFyTLN 

SNVHSDFTRAMASVARTPQQYERSFHFKYLHYYLTSAIQLLRKD 

SIMENGTLCYEVHYRTKDVHFNAYTGATIRPGQFLSTSLLKEEA 

QEFGNQTLPTlFTCLGAPVQYFSLKKEVLIPPYErjFKVINMSYH 

PRGDWLQLRSTGWLSTYWCQLLKASSKKCIPDPIAIASLSPLTS 
VIIPSKSRV 


6400 


75 


1245 


PNLETYFGKRCEKDSMNE'TPTHTPVCRKRTWS'KRGVAVSGPTK 
RRGMADSLESTPLPSPEDRIAKLHPSKELLEYYQKKMAECEAEN 
EDLLKKLELYKEACEGQHKLECDLQQREEEIAELQKALSDMQVC 
LFQEREHVLRLYSENDRLRIREliEDKKKI^^JLLALVGTDAGEVT 
YFCKEPPHKVTILQKTIQAVQECEQSESSAPKADPKISKRRPSR 
ERKBSSEHYQRDIQTLILQVEALQAOLGEQTKLSREQIEGLIED 
RRIHLEEIQVQHQRNQNKIKELTKNLHHTQELLYESTKDPLQLR 
SENQNKEKSWMLEKDNLMSKIKQYRVQCKKKEDKIGKVLPVMHB 
SHHAQSBYIKVMSLCRNEWYFSGRVEGIPKNLQPVM 


6401 


2520 


1053 


KTMKCDEWYiiVQSAILRHNCGYAMKTGKPPHNLMERKDPETWL 
DNISVTFLSLTDLQKNETLDIILISLSGAVQLRHLSNNLETLLKR 
DPLKLLPLELSPYI^LKWLDPQTLLTCCLVSKQWKKVISACTEVW 
QTACKNLGWQIDDSVQDALHWKKVYLKAILRMKQLEDHEAFETS 
SLIGHSARVYALYYKDGLLCTGSDDLSAKLWDVSTGQCVYGIOT 
HTCAAVKFDBQKLVTGSFDNTVACWBWSSGARTQHFRGHTGAVP 
SVDYFnDELDILVSGSADPTVKVWALSAGTCLNTLTGHTEWVTKV 
VLQKCKVKSLLHS PGDYILLSADKYEIKI WPIGREI NCKCLKTL 
SVSEDRSICLQPRLHFDGKYIVCSSALGLYQWDFASYDILRVIK 
TPEIANLALLGFGDIPALLFDNRYLyiMDLRTESLISRWPLPEY 

RKSKRGSSPLAQEASWLNGLDGHNDTGLVFATSMPDHSIHLVLW 
KEHG 


6402 




765 


PGAAWSRPDLKGCCTGPQPALRMLVLPSPCPQPLAFS5VETMEG 
PPRRTCRSPEPGPSSSIGSPQASSPPRPNHYLLIDTQGVPYTVL 
VDEESQREPGASGAPGQKKCYSCPVCSRVPEYMSYLQRHSITHS 
EVKPFECDICGKAPKRASHLARHHSIHLAGGGRPHGCPLCPRRF 
RDAGEIAQHSRVHSGERPFQCPHCPRRFMEQWTLQKHTRWKHP 


5403 


1196 


279 


TTSQCGGIRQS5AIPVASMEPAAICI,RNALLLLPEEQQDPKQEN 
GAKNSNQLGGNTESSESSETCSSKSHDGDKPIPAPPSSPLRKQE 
LENIiKCSILACSAYVALALGDNLMALNHADKLLQQPKLSGSLKF 
LGHLYAAEAIirSLDRISDAITHLNPENVTDVSIiGISSNEQDQGS 
DKGENEAMESSGKRAPQCYPSSVNSARTVMLFNLGSAYCLRSEY 
DKARKCLHQAASMIHPKEVPPEAILLAVYLBLQNGNTQLALQI 1 
KRHQLLPAVKTHSEVRKKPVPQPVHPIQPIQMPAFTTVORK 


6404 


2 

1012 


1690 

] 
C 
I 
I 
V 

222 -fj 


RGIHTSVLQGNLQNQMYSHNVVIMNLNNLNLTQVQQRNLITNLQ 

RSVDDTSQAIQRIKNDFQNWXJVFLQAKKDTDWLKBKVQSLQTL 
^ANNSAIAKAMWDTLEDMNSnriN*; PTrsnw pm t r e r» » nm^^-Kr^ 

DLQDLHKDABWRTAIKFUQLEERPQLFETDIVNIISNISYTAHH 
DRTLTSNLWEVRTTCTDTLTKHTDDLTSLWNTIANIRLDSVSLR 
viQQDIiMRSRLDTEVAKLSVIMEEMKLVDSKHGQIirKNPTILQGP 
PGPRGPRGDRGSQGPPGPTGNKGQKGBKQEPGPPGPAGERGPIG 
PAGPPGERGGKGSKGSQGPKGSRGSPGKPGPQGPSGDPGPPGPP 
SKEGLPGPQGPPGFQGLQGTVGEPGVPGPRGLPGLPGVPGMPGP 
CGPPGPPGPSGAWPIiALQMEPTPAPEDNSCPPHWKKTFTDKCYY 
^SVEKEIFEDAKLFCEDKSSHLVPINTREEQQWIKKQMVGRESH 
IIGLTDSERENEWKWLDGTSPDYKNV7KAGQPDNMGHGHGPGEDC 
IGIiIYAGQWNDFQCEDVMNFICBKDRETVLSSAL 

JWU^AMAAPAPGIilSVFSSSQEZiGAAIiAQLVAQRAACCnAGARA" 
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SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Aitixno acid segmenc containing signal peptiSe 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, FaPhenylalanine, Gs=Glycine, 
H^Histidine, I=Isoleucine, K=Lysine, 
L-I>eucine, M-Methionine^ N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S;= Serine, T=Threonine, V^Valine, 
W-Tryptophan, Y^Tyrosine, X=Unknovm, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


S405 






RFALGLSGGStiVSMIoARELPAAVAPAGPASLARWTLGFCDERLV 
PFDHAESTYGLYRTHLLSRLPIPESQVITINPELPVEEAAEDYA 
KKLRQAFQGDSIPVPDLLILGVGPDGHTCSLFPDHPLLQEREKI 
VAPISDSPKPPPQRVTLTLPVLNAARTVIFVATGEGKAAVLKRI 
LEDQEENPLPAALVQPHTGKLCWFIiDKAAARIjLTVPPEKHSPIi 




1 


1456 


AALPRPTPRAPLGREGTGSDSEMAASMFYGRLVAVATLRNHRPR 
TAQRAAAQVLGSSGLFNNHGLQVQQQQQRNLSLHEYMSMELLQE 
AGVSVPKGWAKSPDEAYAIAKKLGSKDWIKAQVLAGGRGKGT 
FESGLKGGVKIVFSPEEAKAVSSQMIGKKLFTKQTGEKGRIC3MQ 
VLVCERKYPRREYYPAITMERSFQGPVLIGSSHGGVKIEDVAAE 
TPEAIIKEPIDIEEGIKKEQALQLAQKMGFPPNIVESAAENMVK 
LYSLFL KYDATM IE INPM VEDS DGAVLCMDAKINPD <! w<3 a vn n V 

KI FDUSDWTQEDERDKDAAKANLNTYIGLDGNIGCLVNGAGbAMA 
TMDIIKLHGGTPANFLDVGGGATVHOVTEAFKLiITSDKICVLAlL 
VN I FGG IMRC1> V lAQG I VMAVKDLEI KI PVVVT? T J^flTP vnna v jv 
LI ADSGLKI IiACDDLDEAARMWKLSE IVTLAKQAHVDVKPQLP 


6406 


1036 


167 


HPRgMRGED^i'PEAPPYSSGRYDSIKTEVSGCPEDLTVGRAPTAD 

DDDDDHDDHEDJTOKMWDSEGMDPERLKAFNMPVRLFVDENLDRM 
VPISKQPKEKIQAIIESCSRnFPPPnPPnpmsTTSTVT va r^j^-oKJiv 

KNGMEMTRPTPPHDTSAMAENIIiAAACESETRKAAKRMRLEIYQ 

SSQDEPIALDKOHSRDSAAITHSTYSLPASSYSQDPVYANGGLN 
YSYRGYGALSSNIiOPPASTjnTf3MHQwr3T?cir'T?aoaT aorsni* rx-iT-nr 

CR7VALGSGMGRGKQRPVMBRGCLTA 


5407 


492 


150 


VGLCIiAVSgTVXiAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGV 

SWYQQRAGSAPRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNAG 
VLTI S PVQPEDDADYYCSVGVOPq P 


6406 


1458 


903 


RGCI TSSQAWRLFGG VTRGFNMRI EKCYFCSGPI YPGHGMMFVR 

NDCKVFRFCKSKCHKNFKKKRNPRKVRWTKAFRKAAGKELTVDN 

SFEFEKRRNEPIKYQRELWNKTIDAMKRVEEIKQKRQAKFIMNR 

LKKNKELQKVQDIKEVKQNIHLIRAPIAGKGKOLEPlfMVnriT ni? 
DVDMEDAP IHI^IRAPLAGKGKQLEEKMVQQLQE 


6409 


150 


446 


NTALANLLRCFTCDRLCGGCTAPAPPAHQGIVLQPVMPSCDPGP 

GPACLPTKTFRSYLPRCHRTYSCVHCRAHLAKHDELISKSFQGS 
HGRAYLPNSV 


6410 


85 


607 


RGGTAGCVACIiGCWGQSSSPKAAFPAGSACLPADSCPCLLFQAC 
AlSGLPNCITIHPLNIAAGVWMIMNAFlLLEiCEAPFCCQFIEPA 
NTVAEKVDRLRSWQKAVFYCGMAWPIVISLTLTTLLGNAXAPA 
TGVLYGLSALGKKGDAISYARIQQQRQOADEEglASTr.RnTgT. 


6411 
6412 


302 


772 


RLSIMASSLNEDPEGSRITYVKGDLPACPKTDSLAHCISEDCRM" 
GAGIAVLFKKKFGGVQELI,KrQQKKSGEVAVLKRDGRYiyYLITK 
KRASHKPTYENLQKSLEAMKSHCLKlTGVTDLSMPRrGCGLDRLQ 
WENVSAMIEBVFEATDIKITVYTL 




61 


1709 

h 


RPVTSFSPLPGSCGGRLGTRTMLGliSiREVSAALKyGmTPTBL 
CX3KCLSLIKKTKPLHAYITVSEEVA1iKQAEBSEKRYKNGQSLGD 
LDGIPIAVKDNFSTSGIBTTCASNMLKGYIPPYNArWQKLLDQ 
GALLMGKTNLDEFAMGSGSTDGVFGPVKNPWSYSKQYREKRKQN 
PHSENEDSDWLITGGSSGGSAAAVSAFTCYAALGSDTGGSTRNP 
AAHCGLVGFKPSYGLVSRHGLIPLVNSWDVPGILTRCVDDAAIV 
LGALAGPDPRDSTTVHEPUSTKPFWLPSLADVSKLCIGIPKEYLV 
PELSSEVQSLWSKAADLFESEGAKVIEVSLPHTSYSIVCYHVLC 
TSEVASNMARFDGLQYGHRCDIDVSTEAMYAATRREGFNDWRG 
RILSGNFFLLKENYENYPVKAQKVRRLIANDFVNAFNSGVDVLL 
rPTTLSEAVPYLEFIKEDNRTRSAQDDIPTQAVNMAGLPAVSIp 

yALSNOGLPIGIjQFIGRAFCXJQQLLTVAKWFEKQVOPPVIQLQE 
L/MDDCSAVLENEKIiASVSLKQ 
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SEQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{AaAlanine, C=Cysteinei D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=:Histidine, I=Isoleucine, KaLysine, 
Ij= Leu cine, M=:Methionine, N=Asparagine , 
?=Proline, Q:=Glutamine, R«Arginine, 
S=Serine, T=Threonine, V»=.Valine, 
W^Tryptophan, Y=Tyro3ine, X=UnkJiovm. *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6413 


2 


885 


HEPRCAGMAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIR 
IJRLTGI PAGYCFVEFADLATAEKCLHKINGKPLPGATPAKRPKL 
NYATYGKQPDMSPEYSLFVGDLTPDVDDGMLYEFFVKVYPSCRG 
GKWLDQTGVSKGYGFVKFTDELEQKRALTECQGAVGIiGSKPVR 
LSVAIPKASRVKPVEYSQMYSYSYNQYYQQYQNYYAQWGYDQNT 
GSYSYSYPQYGYTQSTMQTYEEVGDDALEDPMPQIiDVTEANKEF 
MEQSEBLYDALMDCHWQPLDTVSSEIPAMM 


6414 


1 


53a 


RGGRAAIiLPWRRFPCCR PR PQPAR PSSRATPGPRS PGMATS I GV 
SFSVGDGVPEAEKNAGEPENTVri.RPUJPfViTO PDDC\nrvnriTuii.t7 
LKHELANAEYSPEBMPQLTKHLSENIKDKLKBMGFDRYKMWQV 

VIGEQRGEGVFMASRCFWDADTDNYTHDVFMNDSLFCWAAFGC 
FYY 


S415 


2 


116B 


FVRQWQS SHRRACGLGCEARAGGGEBPRGRAS S VAGWVGAFRAP " 

FXEAAVAGLGAGSG2CRRRGWKMPVHSRGDKKETNHHDEMEVDYA 

ENEGSSSEDEDTESSSVSEDGDSSSiVIDDEDCERRRMECLDEMSN 

LEKQFTDLKDQLYKER1.SQVDAKLQEVIAGKAPEYLEPLATLQE 

NMQIRTKVAGIYRELCLESVKNKYECEIQASRQHCESEKLLLYD 

TVQS ELS EK IRRLEEDRHS I D ITS ELWND PT.n If n Dww on 

KKKPGWSGPYIVYMLQDLDILEDWTTIRKAMATLGPHRVKTEP 
PVKLEKHLHSARSEEGRLYYDGEWYIRGQTICIDKKDECPTSAV 
ITTINHDE VWFKRPDGS KSKLYISQLQKGKYS r KHS 


6416 


410 


1519 


EI APADIiE IPACTVPVLLSRATSSTMS VTGGKMAPSLTQE ILSHli " 

GIiASKTAAWGTLGTLRTPLNFSVDKDAQRLLRAITGQGVDRSAI 

VDVLTNRSREQRQLISRWFQERTQQDLMKSLQAALSGmjERIVM 

ALLQ?TAQPDAQELRTAI»KASDSAVDVAIEILATRTPPQLQECL 

AVYIOnsTFQVEAVDGITSETSG ILODIiLLALAKOGRDS YSGI IDY 

NLAEQDVQALQRAEQPSREETWVPVFTQRNPEHIiIRVFDQYQRS 

TGQELEEAVQNRFHGnAQVALLGLASVIKNTPLYFADKLHQALQ 

ETEPNYQVLIRILISRCETDLLSIRAEFRKKFGKSLYSSLQDAV 

KGDCQSAIiLALCRAEDM 


6417 


1 


845 


RGESRVLWS ELEGEAGGAGGWAS SLKTARMDNRPATAFVIACVLS 
LXSTIYMAASIGTDFWYEYRSPVQEKTSSDLNKSIWDEFISDEAD 
EKTYNDALFRYNGTVGLWRRCITIPKNMtlWYSPPERTBSFDWT 
KCVSFTLTEQFMEKFVDPGNHNSGIDLLRTYLWRCQFLLPFVSL 
GLMCFGALIGIiCACICRSL YPTIATG ILHLIAGLCTLGSVSCYV 
AGIELLHQKLELPDNVSGEFGWSFCLACVSAPLQFMASALFIWA 
AHTNRKEYTLMKAYRVA 




6418 


2 


662 


TRTRPRRPPGLGAAVGKAGARSTSTPAGASPAAAYQADPPPPAH 
TPAPPPPPPCGGI ACHOEPAKFYGYDNLQRQP I PTTQQEAEIjVQ 
YPDCKSSSGNI GBDPDHLNQSSS PS QMFPWMRPQAAPGRRRGRQ 
TYSRFQTIiELEKEFLFNPYIjTRKRRIEVSHALALTERQVKIWFQ 
NRRMKWKKENNKDKPPVSRQEVKDGETKKEAQELEEDRAEGLTN 




6419 


1 


S73 


PGRPRVRNFDUJTSKS ILQEPFCTRS IQI PANRSKtAMSKCPI PP 

marsistsgpldkbdtgrqklistgslpatlqgatdsuslewhl 

PSPDPVTVPYLS PLWWKELESLLENEODHAI T VADFVDHHP IV 

fwklv57yfrrldlpsnlpglilssbhcnkvsk i prhcmseds ky 
vl iq^3lwdnmklhqdpgqpiiyilwnahtqkyp^ivhllqksdnsp 
nqelriksmvksikmndvygpmsqiletlnkcphfkrqrslyrei 
lflslvalgrenididafdkeykmaydrltpsqvksthncdrpp 
stgvmecrktfgepyl 




6420 


207 

J- 


1187 


RKMIDKWQTCGVGQDSVPYMICLIHILEEWFGV^C^LEbYLNFAN 
YLLWVFTPLII.LIIjPYFTIFLLYLTIIFLHIYKRKNVLKEAYSH 
NLWDGARKTVATLWDGHAAVWHGYEVHGMEKIPEDGPALIIPYH 
GAI P IDFYYFMAKI FIHKGRTCRWADHFVPKIPGFSLCiLDVFC 

alhgprekcveilrsghlriai spggvrbali sdetyni vwghrr 
gfaqvaidakvpiipmftqwiregfrslggtrlfrwlyekfryp 
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SEQ 
ID 
NO: 


Predicted 
beginnincf 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
[A;=Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G-Glycine, 
Hs:Histidine, I=Isoleucxne, K=Lysine, 
L=Leucine, M=Methionine, N^Asparagine 
P=Proline, Q^Glutamine, R=Arginine, ' 
S=Serine, T=Threonine, V=Valine, 
W:=Tryptophan, Y-Tyrosine, X^Unknown, *=Stop 
Codon, /-.possible nucleotide deletion, 
\=possible nucleotide insertion) 


6421 






*APMYGGFPVKLRTyi/3DPIPYDPQlTAEELAEKTKNAVQALlD 
KHQRIPGNIMSALLERFH 


6422 


1844 


362 


WAi.SI.RRQPEJ2MSWKLLSPHP«SVVLRSEFKMASSPAVLRASRL 

YQWSLKSSAQFLGSPQLRQVGQIIRVPARMAATLILEPAGRCCW 

DEPVRIAVRGLAPEQPVTLRASLRDEKGAI^FQAHARYRADTLGE 

LDLERAPALGGSFAGLEPMGLLWALEPEKPLVRIiVKRDVRTPIA 

VELEVIiDGHDPDPGRLLCQTRHERYFLPPGVRRBPVRVGRVRGT 

LFLPPEPGPFPGIVDMFGTGGGLLEYRASLLAGKGFAVMALAYY 

NYEDLPKTMETLHLEYFEEAMNYLLSHPEVKGPGVGLLGISKGG 

ELCLSMASFLKGITAAWINGSVANVGGTLRYKGETLPPVGVNR 

NRIKVTKDGYADIVDVLNSPLEGPDQKSFIPVBRAESTFIiFLVG 

ODDHNWKSEPYANBACKRLQAHGRRKPQIICYPBTGHYrEPPYF 

PLCRASLHALVGSPIIWGGEPRAHAMAQVDAWKQLQTFFHKHLG 
GRBGTIPSKV ^"^^nxj^ 




181 


2133 


LGENLSWFOKywUDIAKBFYWKTPCPGpyLRYMFDVTKGKIFIE" 

WMKXSATTNICYNVLDRNVHEKKLGDKVAFYWEGNEPGETTQITY 

nyAjLiVgvCQFSNVURKQGlHKGDRVAlYMPMIPELWAMIiACAR 

IGAIiHSIVFAGFSSESLCERILDSSCSLLITTDAFYRGEKLVNL 

KELADEALQKCQEKGFPVRCCIWKHLGRAEIiGMGDSTSQSPPI 

KRSCPDVQISWNQGIDLWWHELMQEAGDECEPEWCDAEDPLFIL 

YTSGSTGKPKGWHTVGGYMLyVATTFKYVFDFHAEDVFWCTAD 

J.vjwi iiarttoX VTYGPIiANGATSVLFEGIPTYPDVNRLWSXVDKYK 

VTKFYTAPTAIRLLMKFGDEPVTKHSRASLQVLGTVGEPINPEA 

WliWYHRWGAQRCPIVDTPWQTETGGHMLTPLPGATPMKPGSAT 

FPPFGVAPAILNESGEELEGEAEGYLVFKQPWPGIMRTVYGNHE 

RFETTYFKKPPGYYVTGDGCQRDQDGyywiTGRIDDMLNVSGHL 

LSTAEVESALVEHEAVAEAAWGHPHPVKGECLYCFVTLCDGHT 

FSPKLTBELKKQIREKIGPIATPDYIQNAPGLPKTRSGKIMRRV 

LRKIAQNDHDLGDMSTVADPSVISHLPSHRCLTIQ 


6423 
6424 


614 

1 


1237 


AWI.K£IPRDI,l^FaTVI.IlYI.DS^I^lTSU•^NEIFKDLHQLRVLNLS~ 
*vi,,wr A c c AucxiAr I Liy J, JjDIiS dnri qs VHKNAFIINLKARA 

RIAWNPWHCDCTLQQVLRSMASNHETAHNVICKTSVLDEHAGRP 
PLNAANDADIiCNLPKKTTDYAMLVTMFGWFTMVISYVVYYVRON 
QEDARRHLEYLKSLPSRQKKADEPDDISTW 


6425 




1186 


KKVaWPVAAMVHCSCVLFKKyGNFIDKLRLFTRGGSGGMGYPRL- 

GGEGGKGGDVWWAHNRMTLKQLKDRYPRKRFVAGVGANSKISA 
liKGSKGKDWEI PVPVGISVTDENGKIIGETjMmrijnDTT \r-i^f>r*r^r 

GGKLLTNFLPLKGQKRIIHLDLKLIADVGLVGFPNAGKSSI^LSC 
VSHAKPAIADYAFTTLKPELGKIMYSDFKQISVADLPGLIEGAH 
MNKGMGHKPLKHIERTRQLLPWDISGFQLSSHTQYRTAFETir 
LLTKELELYKEELQTKPAliLAVNKMDLPnAQDKFHSLMSQLQNP 
KDFLHLFEKNMIPBRTVEFQHIIPISAVTGEGIEELKNCIRKSL 
DEC2AKQENnALHKKQLLNLWr SDTMSSTEPPSKilAVTTSKMDr T 


6426 


1850 


1144 

: 


LAMfiOGGGIPHi:Tl.KBE5QSRHVLPASFEVNSi;UKSWWGFLLTG"" 

u viaur 1 VAVlfAVATPFVTPALRKVCLPFVPATMKQIENWKMLR 

CRRGSLVDIGSGDGRIVIAAAKKGPTAVGYELNPWLVWYSRYRA 

WREGVHGSAKFYISDLWKVTFSQYSNWIFGVPQMMLQLEKKLE 

RELEDDARVXACRFPPPHWTPDHVTGEGrDTVWAYDASTPRGRE 
KRPCTSMHPQLPIQA 


6427 


30 


565 i 
] 
1 
I 


gRGAAVGGMSVAGGSIRGDTGGEDTAAPGRFSFSPKPTLEDIRR- 
.HAEFAAERDWEQFHQPRNLLLALVGEVGBLAELFQWKTDGEPG 
sQGWSPRERAALQEELSDVLIYLVAIiAARCRVDLPlAVLSKMDI 
RRRYPAHLARSSSRKYTBLPHGAISEDQAV6PAD1 PCDSTGQT 




145 


953 J 


LASWQPPHVPKAGKMVSWMICRLWLVFGMLCPAYASYKAVKTK " 
riREYVRWMMYWIVFALFMAAEIVTDIFISWFPFYYBlKMAFVL 
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SEQ 
ID 
NO: 


Predicted 
besinning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A«Alanine, C=Cysteine, D«Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H^Histidine, I«l3oleucine, K=Lysine, 
L^Leucine, M=Methionine, rT==Asparagine, 
P=Proline, Q=Glutamine, RoArginine, 
S=Serine, T=Threonine, V^*. Valine, 
W^Tryptophan, Y^Tyrosine, X-Unknown, *=Stop 
Codon, /^.possible nucleotide deletion, 
\=possible nucleotide insertion) 








WLLSPYTKGASIiIiYRKFVHPSLSRHEKEIDAyiVQAKERSYETV"' 

LSFGKRGtiNIAASAAVQAATKSQGAliAGRI*RSFSMQDLRS ISDA 

PAPAYHDPLYLEDQVSHRRPPIGYRAGGLQDSDTEDECWSDTEA 

VPRAPARPREKPLIRSQSLRWKRKPPVREGTSRSLKVRTRKKT 
VPSDVDS 


&428 


1982 


444 


SGSGGKMEDHQHVP IDIQTSKLUDWLVDRRHCSIiKWQSLVLTlR " ' 
EKINAAIQDMPESEEIAQIjLSGSYIHYFHCLRILDIJDKGTEAST 

knif6ryssqrmkdwqeiialyekdntylvelssllvrnvnyei 
pslkkqiakcqqlcobysrkeeecqagaaemreqfyhsckqygi 
tgenvrgellalvkdlpsqlaeigaaaqqslgeaidvyqasvgf 
vcespteqvlpmlrfvqkrgnstvyewrtgtepswerphleel 
peqvaedaidwgdfgveavsegtdsgisabaagidwgifpesds 
kdpggdgidwgddavalqitvleagtqapeqvargpdaltlley 
tetrnqpldelmbleiflaqravelseeadvlsvsqfqlapail 

QGQTKEKMVTMVSVIiEDLIGKLTSIiQLQHLFMlLASPRYVDRVT 
EFJjQQKLKQSQLIiALKKELMVQKQQEALEEQAAliEPKLDLLLEK 

tpcelqklieadiskrysgrpvnlmgtsl 


6423 


3413 


3442 


EPSSWTAAPRGPIAAHPLEAAVQEDDRRALSFDSRIKVFANGTtT" 

wksvtdkijagdylcvarnkvgddywlkvdvvmkpakiehkee 

NDHKVFYGGDLKVDCVATGLPNPEISWSLPDQSLVNSPMQSDDS 
QGRTKR YWPNNGTIi YFNE VGMREEfm YTr pa ■FTjn\>r' xmirMTM m 

VKWTAPATIRNKTCLAVQVPYGDWTVACEAKGBPMPKVTWLS 
PTOKVIPTSSEKYQIYQDGTLLIQKAQRSDSGNYrci.VRNSAGE ' 

drktvwihvnvqppkingnpnpittvreiaaggsrklidckaeg 

IPTPRVLWAFPEGWLPAPYYGNRITVHGNGSLDIRSLRKSDSV 
QLVCMARNEGfeEARLIVQLTVLEPMEKPIFHDPISEKITAMAGH 
TISLNCSAAGTPTPSLWfVLPNGTDLQSGQQLQRFYHKADGMLH 
ISGLSSVDAGAYRCVARKAAGHTERLVSIjKVGIiKPEANKQYHNL 
VSIINGETLKLPCTPPGAGQGRFSWTLPNGMHLEGPQTLGRVSL 
LDNGTLTVRSASVPDRGTYVCRMETEYGPSVTS I PVIVIAYPPR 
rTSEPTPVT:YTRPGNTVKLNCMAMGlPKADITWELPDiCSHi:.KAG 

VQARLYGNRFLHPQGSLTIQHATQRDAGFYKCMAKNIIjGSDSKT 
TYIHVF 


6430 


1546 


602 


RTRVSTGLRRTLLWSEAVGASSTRGDTGIPGSGEGGAGPGGGEG 

amleamaepspedppptlkpetqppekrrrtiedfnkfcsfvla 
yagyippskeesdwpasgsssplrgbsaadsdgwpsapsdlrti 
qtfvkkaksskrraaqagptqpgpprstfsrlqapdsatllekm 
klkdslfdldgpkvasplsptslthtsrppaaltpvplsqgdls 

HPPRKKDRKNRKLGPGAGAGFGVLRRPRPTPGDGEBaiSRIKKSK 

jcrklkkaergdrlpppgppqappsdtdseeeeeeebeeeebbma 

TWGGEAPVPVLPTPPEAPRPPATVHPEGVPPADSESKEVGSTE 

TSQDGDASSSEGEMRVMDEDIMVESGDDSWDIilTCYCRKPPAGR 

PMIECSLCGTWIHLSCAKIKKTNVPDFFYCQKCKELRPBARRLG 
GPPKSGEP 


6431 


3 


605 


WWNSSYKLPAYAPYLPCEACAMQDGRKGGAYAGKMEATTAGVGR" 

LBEEALRRKSRLKALREKTGRKDKEDGEPKTKHLREEEEEGEKH 

RELRLRWYVPEDEDLKKRRVPQAKPVAVEEKVKEQLBAAKPEPV 

lEEVDIANLAPRKPDWDLKRDVAKKLEKLKKRTQRAIAELIRER 

LKGQEDSIiASAVDAATEQKTCDSD 


6432 


56 


1692 


GGLGTMGSRIKQNPEXTFEVYVEVAYPRTGGTLSDPEVQRQFPE 
DYSDQEVLQTLTKFCFPFYVDSIiTVSQVGQKFTFVLTDIDSKQR 
FGFCRLSSGAKSCFCILSYLPWPEVFYKLLNILADYTTKRQENQ 
WNELLETLHKLPI PDPGVSVHLSVHSYPTVPDTRELPS I PENRN 
LTE YFVAVDVNNMLHI,YAS^^LYBRRI LI I CSKLSTLTACIHGSA 
AMLYPMYWQHVYIPVLPPHLLDYCCAPMPYLrGIHLSLMEKVRN 
MALDDWiLNVDTNTIiETPFDDLQSLPNDVISSLKNRLKKVSTT 
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PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

nucleotide 

location 

to first 
amino acid 
residue of 
amino acid 
setjuence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine. D^Aspartic Acid, 
Glutaraic Acid, F= Phenylalanine, G=Glycine 
H=Hi3tidine, I^Isoleucine, K^Lysine, 
L=Iieucine, M«Methionine, N=Asparagine , 
P=Proline, Q=^Glut amine, R=Arginine, 
S^Serine, T=Threonine, V^Valine, 
W=.Tryptophan, Y=Tyrosine, X»Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
_ \ -possible nucleotide insertion) 








TGDGVARAt'liKAQAAFFGSYRNALKIEPEBPITFCEEAFVSHYR 
SGAMRQFLQNATQLQIiFKQFIDGRLDLLNSGEGFSDVFEEEIWM 
GBYAGSDKLyHQMLSTVRKGSGAILNTVKTKAWPAMKTVYKPDI 
AENGCAPTPEEQLPKTAPSPLVEAKDPKLREDRRPITVHFGQVR 

PPRPHWKRPKSNIAVEGRRTSVPSPEQNTIATPATLHILQKSI 
TKFAAKFPTRGWTSSSH 


6433 
6434 


1S24 


4S4 


APVTKRKEVFAKDSKGSALDAGRDPKRPALPETLCESGWASNTA 
PTTPPQPGWCLCGKDFKSSCQTPGREKERRIATMHGSCSFLMLL 
LPLLLLLVATTGPVGALTDEEKRLMVELHNLYRAQVSPTASDML 
HKRWDEELAAFAKAYARQCVWGHNKERGRRGENLFAITDEGMDV 
PLAMEEWHHEREHYNLSAATCSPGQMCXSHyTQWWAKTERIGCXS 
SHFCEKLQGVEETNIELLVCJTYEPPGNVKGKRPYQEGTPCSQCP 
SGYHCKNSLCEPIGSPEDAQDLPYLVTEAPSFRATEASDSRKMG 
AEGPDKPSWSGLNTSGPGHVMGPLLGIjZjLLPPLVLAGI f 


6435 


40 


2002 


MPQr^NFGMADPTQMGGIjSMLrjLAGEHALGTPEVFSGTCRPDVSE 

spelrqksplfqfaeissstshsdastkqcqtsalfqfaeissn 

TSQLGGAEPVKRCGKSALFQJjAEMCIiASEGMKMEESKLIKAKES 

dggrikelekgkeekeikmektdetrlqkeaefeksakenlrds 
kelrnfealqiddimaikmedpkeirkeeleedhkcshppdfsy 

SASSKIIISDVPSRKDHMCHPHGIMIIEDPAALNKPEKLKKKKK 
KSKMDRHGNDKS TPKKTCKKRQSSESD r BSVI YTIEAVAKGDWG 
lEKLGDTPRKKVRTSSSGKGSII^KPPKKXVKSRBKKMSKEKS 
SDTTKESRPPDFISISASKNISGETPBGIKAEPLTPMEDAIiPPS 
LSGQAKPEDSDCHRKIETCGSRKSERSCKGALYKTLVSEGMLTS 
liRANVDRGKRSSGKGNSSDHEGCWNEESWTFSQSGTSGSKKPKK 
TKPKEDCLLGSAKIiDEBFEKKPKrSLPQYSPVTPDRKCVPVPRKK 
KKTGKVSSEPTKTSKGSGDKWSNKQLPLDAIHPTEAIFSEDRNT 
MEPVHKVKNIPSIFNTPEPTTTARTFGGQPKEKSKENPDYSPCQ 
DTQRAGYHHEEVLWMTNLMNWCGGVYLKQLRHTAMTNA 


6436 


2227 


657 


ALQRDAAAAYAHPEYEERFLQEETVSQQINSIELLQTRPLALPE 
WKSQRPLQRQVHLRGRPAS0PTVIRGITYYBCAKVJ5EEENDIEE 
QQDEFFSGDNGVDLIilEDQLLRHNGLMTSVTRRPAATRQGHSTA 
VTSDLNARTAPWS SALPQPSTSDPS lANHASVGPTLQTTS VS PD 
PTRBSVLQPSPQVPATTVAHTATQQPAAPAPPAVSPREALMEAM 
HWPVPPTTVRTDSLGKDAPAGRGTTPASPTLSPBEEDDIRNVI 
GRCKDTLSTITGPTTQNTYGRNEGAWMKDPLAKDERIYVTNYYY 
GNTLVEFRNLENFKQQRWSNSYKLPYSWIGTGHWYNGAFYYNR 
AFTRNI IKYDLKQRY VAAWAMLHDVAYEEATPWRWQGHSDVDFA 
VDBNGLWLIYPALDDEGFSQEVIVLSKLNAAD^iSTQKETTWRTG 
LRRNFYGNCFVICGVLYAVDSYNQRUANISYAFDTHTNTQIVPR 
LLPENEYFYTTQIDYNPICDRLLYAWDNGHQVTYHVIFAY 


6437 


1295 ■ " 


341 


GACRPPVRQDPDSGPDYEALPAGATVTTHMVAGAVAGILEHCVM 

YPIDCVKTRMQSLQPDPAARYRNVLEALWRIIRTEGLWRPMRGL 

NVTATGAGPAHALYFACYEKLKKTLSDVIHP6GNSHIANGAAGC 

VATLLHDAAMNPAEWKQRMQMYNSPYHRVTDCVRAVWQNEGAG 

AFYRSYTTQLTMNVPFQAIHFMTYEFLQEHFMPQRRYWPSSEIVL 

SGACAGAVAAAATTPLDVCKTLLWTQESI.ALNSHITGHITGMAS 

AFRTVYQVGGVTAYFRGVQARVIYQIPSTAIAWSVYEFFKYLIT 
KRQBEWRAGK 




1828 


360 

: 


PPAPAPPASPAKHVTRTARGHLEGGSRAPPLI^QAVFLQIKNMVK 
blHTLADHGDDVNCCAFSFSIiLArCSriDKTIRLYSLRDFTELPH 
SPLKFHTYAVHCCCFS PSGHILAS CSTDGTTVLWNTEWGQMIAV 
^OPSGSPVRVCQFSPDSTCLASGAADGTWLWWAQSYKLYRCG 
3ViaDGSIAACAFSPNGSFPVTGSSCGDLTVWDDKMRCX.ESEKAH 
DLGITCCDFSSQPVSDGEQGLQFPRLASaSQDCQVKIWlVSFrH 
CLGFELKYKSTLSGHGAPVLACAFSHDGQMLVSGSVDKSVIVYD 
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PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptiHT" 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E=: 
Glutamic Acid, F==Phenyl alanine. G=Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6438 






TKTENHjHTLTQHTRYVTTCAFAPNTLLIiATGSMDKTVNIVfQFD 

IiETIiCQARSTEHQLKQFTEDWSEBDVSTWLCAQDLKDLVGIFKM 

NNIDGKBLLNLTKESLADDLKIESLGliRSKVLRKIEELRTKVKS 

LSSGIPDEPICPITRELMKDPVIASDGYSYEKEAMENWDPAKRN 
RTSPP 


6439 


109 


901 


EVQILRAKMFQTGGLIVPYGLLAQTMAOFGGLPVPLnnTT.pr.w 
NPAI1PLSPTGIAGSLTNAI.SNGLLSGGLLGILENLPLLDILKPG 
GGTSGGLLGGLLSKVTSVIPGLNNIIDIKVTDPQLLELGLVQSP 
DGHRLYVTI PLGI KLQVNTPLVGAS LLRLAVKLD ITAE ILAVRD 
KQERIHLVLGDCTHSPGSLQISLLDGLGPLPIQGLLDSLTGILN 
KVLPELVQGNVCPLVNEVLRGLDITLVHDIVNMLIHQLOFVIKV 


6440 


23 


412 


SIQTASAITTEMASQSQGI QQI^LQAEKRAAEKVADARKKKARRL 
KQAKEEAQMEVEQYRREREHEFQSKQQAAMGSQGNLSAEVEQAT 
RRQVQGMQSSOQRNRERVLAQLLGMVCDVRPQVHPNYRISA 


6441 


3 


517 


RARWNSDMGDTjPRTiVRT.Q T aT.PTr<PXTnr^-DT>p\/wTw/-iA-i«.^-.>^v — 

JWA-»Ajrv>u vKjji J.'ilJKiyfWU^yir'Vl? X ItVlXiQRPGQNRT 

IKLLTGSSYKVEVKIKPSTLQVENISIGGVLVPLELKSKEPDGD 
RWYTGTVDTEGVTPTKSGERQPIQITMPFTDIGTFSTVMQVKF 
YNYHKRDHCQWGSPPSVIEYECKPNETRSLMWVNKESFL 


6442 


234 


1373 


KSGGI^RRRQRPGRSAAVGEEELPPGMEKFKAAMLLGSVGDALGY 
RNVCKENSTVGMKIQEELQRSGGLDHbVLSPGEWPVSDNTIMHI 
ATABALTTDYWCIiDDL YREM VRCYVE IVEKr.PVH»PnnA'PTr'/>/-» 

AQLKPNWYLLAWHTPFNEKGSGFGAATKAMCIGLRYWKPERLET 
LIEVSVECGRMTHNHPTGPLGSLCTALPVSFAAQGKPBVQWGRD 
MLRAVPIAEEYCRKTIRHTAEYQEHWFYFEAKWQFYLESRKISK 
DSENKAIFPDNYDAEBREKTYRKWSSEGRGGHRGHDAPMIAYDA 
LLAAGNSWTELCHRAMFHOGESAATGTIAGCLFGIiLVfST.nT.VDw 
GLYQDLEDKEKLEDLGAALYRLSTEEK 


6443 


34 


796 


AEDPAGGIAGQDTMFARGLKRKCVGHEEDVEGALAGLKTVSSYS 
LQRQSLLDMSLVKLQLCHMLVEPNLCRSVLIANTVRQIQEEMTQ 
DGTWRTVAPQAAERAPLDRLVSTEILCRAAWGQEGAHPASGLGD 
GHTQGPVSDLCPVTSAQAPRHLQSSAWEMDGPRBNRGSFHKSLD 
QIFETLETKNPSCMEEDFSDVDSPYYDLDTVLTGMMGGARPGPC 
EGLEQLAPATPGPS SS CKSPIiGBLDKWE I LVET 


6444 


2 


555 


MASPAASSVRPPRPKKBPQTLVIPKWAAEEQKLKLERLMKNPDK 

AVPI PEKMSEWAPRPPPBFVRDVMGSSAGAGSGEFHVYRHLRRR 

EYQRQDYMDAMAEKQKLDAEFQKRLEKNKrAAEBQTAKRRKKRQ 

KLKEKKLIAKKMKLEQKKQEGPGQPKEQGSSSSAEASGTEEEEE 
VPSFTMGR 


6445 


390 


899 


GyrPRGKMRAPIPEPKPGDLIEIPRPFYRHWAXYVGDGYVVHLA 
PPSEVAGAGAASVMSALTDKAIVKKELLYDVAGSDKYQVNNKHD 
DKYSPLPCSKI IQRAEELVGQBVLYKLTSENCEHFVNELRYGVA 
RSDQVRDVI XAASVAGMGLAAMSLIGVMFSRNKRQKQ 


6446 


2 


753 


AGiUiUAAGAAkSFRPQAHTKGVRGLPSRRRSPDCGRMEJbAAGSP 
SEEQFWEACAELQQPAIjAGADWQLLVETSGISIYRLLDKKTGIiY 
BYKVFGVLBDCSPTLIADIYMDSDYRKQWDQYVKELYEQECNGE 
TWYWEVKYPPPMSNRDYVYIJIQRRDLDMEGRKIHVILARSTSM 
PQLQBRSGVIRVKQYKQSIiAIESDGKKGSKVFMYYFDWPGGQIP 
SWLINWAAKNGVPNFLKDMARACQNYLKKT 




1 


1651 

} 
( 
] 


KUi'Xi<bi'PPj:)TPGSRGTTAMCSLASGATGGRGAVENEEDi;PE!:S~ 

DSGDEAAWEDEDDADLPHGKQQTPCLFCNRbFTSAEETFSHCKS 

EHQFWIDSMVHKHGLEPYGYIKLINFIRLKWPTVEYMNSIYNPV 

PWEKEEYLKPVLEDDLLLQFDVBDLYEPVSVPFSYPWGLSENTS 

/VEKLKHMEARALSAEAALARAREDLQKMKQFAQDFVMHTDVRT 

:SSSTSVIADLQEDEDGVYFSSYGHYGIHEEMLKDKIRTESYRD 

=;IYQNPHIFKDKWLDVGCGTG1LSMFAAKAGAKKVLGVDQSEI 
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PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
ainino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acxd segment containing signal peptlH^ 
{A=.Alanxne, C=Cysteine, D^Aspartic Acid. B= 
Glutamic Acid. F=Phenylalanine, G-Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M== Methionine, N^AsparagiAe, 
P=Proline, Q«Glutamine, R^Arginine, 
SaSerine, T==Thr eonine , V=Valine, 
"^Tryptophan, Y^Tyxosine, X=Unknown. *=stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6447 






I^YQAMDIIKUMKLEDTITLIKGKIEEVHLPVEKVDVIISEWMGY 
FliLFESMLDSVLYAKmCYLAKGGSVYPDICTISLVAVSDVNKHA 
DRIAFWDDVYGFKMSCMKKAVrPEAWEVLDPKTLlSEPCGIKH 
IDCHTTSISDLEFSSDFTLKITRTSMCTAIAGYFDIYFEKNCHN 
RWFSTGPQSTKTHWKQTVFLLEKPPSVKAGEALKGKVTVHKNK 
_ KDPRSLTVTLTLNNSTQTYGLQ 


6448 


1554 


1068 


KLGPAEWHLSGPCHATLGAANRGRALGVRAAWRGAPLCQRVMMP ' 
SRTNLATGIPSSKVKYSRLSSTDDGYIDLQFKKTPPKIPYKAIA 
LATVLFLIGAPLI I IGSLLLSGYISKGGADRAVPVLI IGILVFL 
PGFYHLRIAYYASKGYRGYSYDDIPDFDC 


6449 


74 


559 


GQVLSHCYHYRSSRWRRGGLSRGRGAGVMALVPYEHTTEFGLQK 
FHKPLATFSPANHTIQIRQDWRHLGVAAWWDAAIVliSTYLEMG 
AVELRGRSAVELGAGTGLVGIVAALLACRIRYERDKNFIaAMLER 
QFXVRKVHYDPEKDVHIYEAQKRNQKEDL 


6450 


597 


1876 


EYGVCENLRKLKiTGVSCRDVVAKLLHRYRri^LGLWQPDIGPYG- 

GLLNVWDGLPIIGWMYLPPHDPHVDDPMRFKPLFRIHLMERKA 
ATVECMYGHKGPHHGHIQIVKKDEFSTKnPfOTnuiTOMQrir'or^T^c. 

FRTWLREBWGRTLEDIFHEHMQELILMKPIYTSQYDNCLTYRRI 

YLPPSRPDDLIKPGIiPKGTYGSHGLEIVMLSFHGRRARGTKITG 

DPNIPAGQQTVEIDLRHRIQLPDI.ENQRNFNBLSRIVLBVRERV 

RQEQQEGGHEAGEGRGRQGPRESQPSPAQPRAEAPSKGPDGTPG 

EDGGEPGDAVAAAEQPAQCGQGQPPVLPVGVSSRNEDYPRTCRM 
CFYGTGLIAGHGFTSPERTPGVPTT.T?nT?not?r'T?TTwr e-r Tr<^-nr.v 

SRVQATFRNADAPSPQAFDEMLKNIQSLTS 


6451 


848 


\ 269 


tvPAPRTVSGKRSLPGEWEERGEGEQRTGREFSGNGGRAVEAAR 
MRLLCGLWLWI^SLLKVIaQAQTPTPLPLPPPMQSFQGNQFQGEWF 

VLOIAGNSPRPEHRALUTAFTATFELSDDGRFEVWNAMTRGQHC 
DTWS YVIilPAAQPGQFTVDHRVWTHEQAGRPnnnpannt7T n c? 


6452 ' 


232 




HSi-TPPTSPRASTMEDVKLEFPSLPQCKEDAEEWTYPMRREMQE 

ILPGLFLGpysSAMRSKLPVLQICHGITHlICIRQlTrEANFIKPN 

PQQLFRYLVLDIADNPVENIIRFPPMTKEFIDGSLQMGGKVLVH 

GNAGISRSAAFVIAYIMETFGMKYRDAFAYVQERRFCINPNAGF 

VHQLQEYBAIYLAKLTIQMMSPLQIERSLSVHSGTTGSLKRTHE 
EEDDFGTMQVATAQNG ^^^i^ rtiL 


6453 " 


1 


652 


KUKGBSSNWEPXJUVYPLKdSd^PRAKVFAVLLSIVLCTVTLFLLQ - 

LKFLKPKIWSFYAFEVKnAKGRTVSLEKYKGKVSLWNVASDCO 

LTDRNYLGLKELHKEFGPSHPSVLAFPCNQFGESEPRPSKEVES 

PARKNYGVTFPIPHKIKILGSEGEPAFRPLVDSSKKEPRWNFWK 

YLVNPEGQWKFWRPEEPIEVIRPDIAALVRQVIIKiOCEDL 


6454 


827 


223 


HKKWX.PGLSMSPRRTLPRPLSLCI.SLCLCLCLAAALGSAQSGS6 

RDKKNCKWFSQQELRKRLTPLQYHVTQEKGTESAFEGEYTHHK 

DPGIYKCWCGTPLFKSETKFDSGSGWPSFHDVINSEAITPTDD 
PSYGMHRVETSCSOCGAHIiGHr PDnapPDWifovr-TKro n^T rir,« 

PADSSGTAEGGSGVASPAOADKAEL 


6455 


827 


223 


tiKKWl.PGI,SMSPRRTLPRPLSLCLSLCI.CLCLAAAJUGSAQSGSC 
RDKKNCKWFSQQELRKRLTPLQYHVTQEKGTESAFEGEYTHHK 
DPGIYKCWCGTPLFKSETKFDSGSGWPSFHDVINSEAITPTDD 
FSYGMHRVETSCSQCGAHLGHIFDDGPRPTGKRYCINSAALSFT 
PADSSGTAEG6SGVASPAQADKAEL 




1042 


173 I 

J 

< 

I 
\ 


<VHUATVSASAAWDALGLPVRSHMQGSTRRM6VMTDVHRRFLQL 
.MTHGVLEBWDVKRLQrHCYKVHDRNATVDKLEDFIMWIWSVLE 
SliYIBIKRGVTBDDGRPIYALVWXATTSISKMATDFAENELDLF 
IKALEIiI IDSETGFASSTNILWLVDC2LKGKKMRKKEAEQVLQKF 
rONKWLIEKEGEFTLHGRAILEMEQYIRETYPDAVKlCNICHSL 
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SEQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corr e sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
c o rr e spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine/ C«Cysteine, D=:Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G-GIycine, 
H=Histidine, I=Isoleucine, K==Lysine, 
L-Leucine, Ms=Methionine, N=Asparagine ^ 
P=Proline, Q=Glutamine, R=Arginine, 
S^Serine, T=Threonine, V=Valine, 
W»Tryptophan, YaTyrosine, X=Unknown* *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LIQGQSCETCGIRMHLPCVAKYFQSNAEPRCPHCNDYWPHEIPK 


6456 


2 


555 


RPQSRSISMWRNSLLQVSSGLRWLRVCAMVDIIiGERHLVTCKGA 
TVEAEAALQNKWAliYFAAARCAPSRDFTPLLCDPYTALVAETUl 
RPAP FEWPVSADGSSQEMLDPMRELHGAWLALPFHD P YRHEI*R 
KRYNVTAIPKLVIVKQNGEVITNKGRKQIRERGLACFQDWVEAA 
DIFQNFSV 


6457 


23 


892 


PTTGFPVTNPPWNWPDGKPPIMILYVSKLNKIIHPPDFDKKIPV 
KLFPLPLLYVGNHISGLSSTSKLSLPMFTVLRKFTIPLTLIjIjET 
1 1 LGKQYSLNI ILSVFAIILGAFIAAGSDLAFNLEGY I FVFiiND 
I FTAANGVYTKQKMDPKELGKYGVLFYNACFMI I PTLI ISVSTG 
DLQQATEFNQWKNWFILQFLLSCFLGFLLMYSTVLCSYYNSAL 
TTAWGAI KNVSVAYIGI LIGGD YI FSLLNFVGLNI CMAGGLRY 
SFLTLSSQLKPKPVGEENICIjDLKS 


6458 


23 


892 


PTTGFPVTNFPWNWPDGKPPIMILYVSKLNKIIHFPDFDKKIPV 
KLFPLPIiLYVGNHISGLSSTS KLS LPMFTVIiRKPTlPLTLLLET 
IILGKQYSLNI ILSVFAI rLGAPlAAGSDLAPNLEGYI FVFLND 
I FTAANGVYTKQKMDPKELGKYGVLFYNACFMI I PTLI I S VSTG 
DLQOATEFNQWKNVVFILQFLiSCFIiGFLLMYSTVLCSYYNSAL 
TTAWGAIKNVS VAYIG 1 LIGGDYI FSLLNFVGLNI CMAGGLRY 
SFLTLSSQLKPKPVGBENICLDIiKS 


6459 


23 


892 


PTTGFPVTWFPWNWPDGKPPIMILYVSKlJjJKlIHFPDFDKKIPV 
KLFPLPLLYVGNHISGLSSTSKLSLPMFTVLRKFTIPLTLLLET 
IILGKQYSLNI ILSVFAI ILGAFIAAGS DLAFNLEGYl PVPLND 
IPTAANGVYTKQKMDPKELGKYGVLPYNACPMIiPrLI I SVSTG 
DLQQATEFNQWKWWFILQFLLSCFLQFLLMYSTVLCSYYWSAL 
TTAWGAIKNVSVAYIGIL IGGDYI FSLLNFVGLNIC3W1AGGLRY 
SFLTLSSQLKPKPVGEBNICLDLKS 


6460 


23 


892 


PTTGFP VTNFPWNWPDGKP P IMI LYVSKLNKI IHFPDFi;>iCKi PV 
KLFPLPLLYVGNH ISGLSSTSKLSLPMFTVLRKFTI PLTLLLET 
IILGKQYSLNI ILSVFAI ILGAPIAAGSDLAFNLEGYI PVPLND 
I FTAANGVYTKQKMDPKELGKYOVLFYNACPMI I PTLI ISVSTG 

TTAWGAI KNVSVAYIGILIGGDYIFSLLNFVGLNICMAGGLRY 
SFLTLSSQLKPKPVGEBNICLDLKS 


6461 


1653 


360 


LQQRTLRITAVGQTHPIAWMAWEPSLGAFYGPASFITFVNCMYF 
LSIFIQLKRHPERKYELKEPTEEQQRLAANENGEINHQDSMSLS 
LI STSALENEHTFHSQLLGAS LTLLLYVALWMPGALAVSLYYPL 
L/Livcac vrLrAloJUoc orlr JP V vflHL.VWKnJJvKJjAWXMTCtJi'GRSS 
YS VQVNVQPPNSNGTNGEAPKCPNSSAESSCTNKSASSFKNS S Q 
GCKLTNLQAAAAQCHANSLPLNSTPQLDNSLTEHSMDNDIKMHV 

GSVQNGLPKSRLGNNEGHSRSRRAYLAYRERQYNPPQQDSSDAC 
STLPKSSRNFEKPVSTTSKKDALRKPAWELENQQKSYGLNLAI 
QMGPIKSNGQEGPLLGTDSTGNVRTGLWKHETTV 


6462 


3 


773 


SEELDREKKLKBDS PRKTPNKESGVPSLPVSLTS I KEEPKEAKH 
PDSQSMEES KLKNDDRKTPVNWKDSRGTRVAVS SPMSQHQSY IQ 
YLHAYPYPQMYDPSHPAYRAVSPVLMHSYPGAYLSPGFHYPVYG 
KMSGREETE KVNTSPS VNTKTTTESKALDLLQQHANQYRSKS PA 
PVEKATAEREREAERERDRHS PPGQRHLHTHHHTHVGMGYPL I P 
GQYDPFQGLTSAALVASQQVAAQASASGMFPGQRRE 


6463 


2 


350 


VILCI LGGWI FKNADRSMEKKKGEPRTRAEARPWVDEDLKDS SD 
LHQAEEDADEWQESEENVEHIPPSHNHYPEKEMVKRSQEFYELL 
NKRRSVRFISNEQVPMEVIDNVIRTAGL 


6464 " 


12 


1154 


GI LRQKEREERNR IHKKEI LFIiBHLLWPSEMSS LSGKVQTVLG 



507 



wo 01/53312 



PCT/USOO/34263 



SEQ " 
ID 
NO: 


I Predicted 
beginning- 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cy3teine, D=.Aspartic Acid, E= 
Glutaraic Acid, P=Phenylalanine, G=Glycine, 
H=!Histidine, I*Isoleucine, K= Lysine, 
Ii=5Leucine, M^Methionine, N=Asparagine, 
P=Proline, Q=61ut amine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
WsTryptophan, Y=Tyrosine, X«Unknown, *=Stop 
Codon, /«possible nucleotide deletion, 
\=spo3sible nucleotide insertion) 








LVEPSKLGRTLTHBHIiAMTFDCCyCPPPPCQSAISKEPIVMKNL 
YW XQKNAYSHKENIiQLNQETEAI KEELLYFKANGGGALVENTTT 
GISRDTQTLKRLAEETGVHIISGAGFYVDATHSSETRAMSVEQL 
TDVLMNEILHGADGTS IKCGI IGE IGCSWPLTESERKVLQATAH 
AOAQLGCPVIIHPGRSSRAPFQIIRILQKAGADISKTVMSHr,DR 
TI LDKKELLEFAQLGC YLEYDLFGTELLHYQLGPDI DMPDDNKR 
IRRVRnLVEEGCEDRILVAHDIHTKTRLMKYGGHGYSHIIjTNVV 
PKMLLRG I TENVLDK I L I ENPKQWLTFK 


6465 


126 


1356 


KMTVFFKTLRNHWKKTTAGIiCliLTWGGHWLYGKHCDjrLLRRAAC 
QEAQVFGNQLIPPNAQVKKATVFUJJPAACKGKARTLFEKNAAPI 
IrHLSGMDVT I VKTDYEGOAKKLLELMENTDVI I VAGGDGTLQEV 
VTGVLRRTDEATFSKIPIGFIPLGETSSLSHTLFAESGNKVQHI 
TDATLAIVKGETVPLDVLQIKGEKEQPVFAMTGLRWGSFRDAGV 
KVSKYWYLEPLKIKAAHPPSTLKEWPQTHQASrSYTGPTERPPN 
EPEETPVQRPSLYRRILRRIASYWAQPQDALSQEVSPEVWKDVQ 
LSTIEIiSITTRNNQIiDPTSKEDFLNICIEPDTISKGDFITIGSR 
KVHNPKLHVEGTECLQASQCTLL I PEGAGGS FS IDSEB YEAMP V 
EVKIiLPRKLQFFCDPRKREQMLTSPTQ 




1134 


828 


VARGTELSQLEKAHPPADMGRRKSKRKPPPKKKMTGTLETQFTC 

PPCNHEKSCDVKMDRAENTGVISCTVCLEEFQTPITYLSEPVDV 
YSDWIDACEAANQ 


6467 


301 


2571 


Q5ELRVIAI*AHGELACHAVLTASLLSLRSRLMDSDMDYERPNVET 

IKCVWGDKAVGKTRLICARACWATLTQYQLLATHVPTWAXDQ 

YRVCQEVLERSRDWDDVSVSLRLWDTPGDHHKDRRFAYGRSDV 

WLCPSIANPNSLHHVKTMWYPEIKHFCPRAPVILVGCQLDLRY 

ADLEAVNRARRPIARPIKPNEILPPBKGRBVAKELGIPYYETSV 

v*v^e v»xisj^ViriJj>tAXKAfUjIfaRRHLQFWKSHLRNVQRPt^ 

PPKP PPP 1 1 WPDPPSSSEECPAHLLEDPLCADVILVLQERVRI 

PAHKIYLSTSSSKFYDLFLMDLSBGELGGPSEPGGTHPEDHQGH 

SDQHHHHHHHHHGRDFLLRAASFDVCBSVDEAGGSGPAGLRAST 

SDGILRGNGTGYLPGRGRVLSSWSRAFVSIQEEMAEDPLTYKSR 

LMWVKMDSSIQPGPFRAVLKYLYTGELDENERDLMHIAHIAEL 

LEVFDLRMMVANI LNNEAFMNQEITKAPHVRRTNRVKECIAKGT 

FSDVTFILDDGTISAHKPLIiISSCDWMAAMFGGPPVESSTREW 

FPyTSKSCMRAVLEVI,YTGMFTSSPDI.DDMKDlILANRLCLPHr, 

VALTEQYTVTGLMEATQMMVDIDGDVIiVFriEIiAQFHCAYQIiADW 

CLHHICTimiNVCRKPPRDMKAMSPENQBYFEKHRWPPVWYLKE 

EDHYQRARKEREKEDYLHLKRQPKRRWLFWNSPSSPSSSAASSS 
SPSSSSAW 


6468 


3 


1374 


UAWAGTNMAALAPVGSPASRGPRLAAGLRLLPMLGI.LQLIAEPG 
LGRVHHIJUl^KDDVRHKVHLNTFGFFKDGYMVVNVSSLSI^EPED 
KDVTIGFSUDRTKNDGFSSYLDBDVNYCILKKQSVSVTLLILDI 
SRSBVRVKSPPEAGTQLPKIIFSRDEKVI.GQSQBPNVNPASAGN 
QTQKTQDGQKSKRSTVDSKAMGEKSFSVHNNGGAVSFQFFPNIS 
TDDQBGLYSL YFHKCLGKELPSDKFTFSLDIE ITB KNPDS YLSA 
GEIPLPKLYISMAFFFFLSGTIWIHILRKRRNDVFKIHWLMAAL 
PPTKSLSLVFHAIDYHYISSQGFPIEGWAWYYITHIiLKGAliLF 
ITIALIGTGWAFIKHILSDKDKKIFMIVlPRRVIiANVAYIlIBS 
TEEGTTEYGLWKDSLPLVDLLCCGAILPPWWSIRHLQEASATD 
GKGKFSRAHFVLIiSLL 


6469 


3 


1374 


DAWAGTNMAALAPVGSPASRGPRIAAGLRliliPMLGLLQLLAEPG^ 

LGRVHHLALKDDVRHK^THIJn'FGFFKDGYMVVNVSSLSLNEPED 

KDVTIGFSLDRTKNDGFSSYLDEDVNYCILKKQSVSVTLLILDI 

SRSEVRVKSPPEAGTQLPKIIFSRDEKVLGQSQEPNVNPASAGN 

QTQKTQDGGKSKRSTVDSKAMGEKSFSVHNNGGAVSPQFFFNIS 

rDDQBGLYSLYFHKCLGKELPSDKFTPSLDlEITEKNPDSYLSA 
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SEQ 
ID 
NO! 


Predicted 
beginning 
nucleotide 
location 
corresponding- 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acxd segment containing signal peptide 
(A=Alanine, C=CyBteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G==Glycine, 
H=Hi3tidine, I-Isoleucine, K= Lysine. 
L=Ijeucine, M=Methionine, N-Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, Ts=Threonine , V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=iStop 
Codon, /==possible nucleotide deletion, 
\=possible nucleotide insertion) 








GEIPLPKLYiSMAFFFFLSGTlWIHlLRKRRNDVFKIHWLMAAL 
PPTKSLSLVFHAIDYHYISSQGFPIEGWAWYyiTHLLKGALLF 
ITIALlGTGWAPIKHILSDKDKKIFMIVIPRRVLANVAYrilES 
TEEGTTEYGIjWKDSLFLVDLLCCGAILFPWWSIRHLQEASATD 
GKGiCFSRAHFVLIiSLL 


6470 
S471 


2726 


1437 


AAASGVSSKADAPVIAQSPASAGNGRPSTPRVPGSRRHPSAPRS 
GPLPREDQGRTPGPQLIiPLPGALIiRPRTLLSSAAETGRSRHPDT 
QHPSSGGRCRGGTESPSSAAGRPASMAEAEEDCHSDTVRADDDE 
BWESPAETDIiQAQLQMFRAQWWFELAPGVSSSWLEKTRPCRAARG 
SLOKTSADTKGKQEQAKEEKARELPLKAVEEEQNGALYEAIKPY 
RRAMQLVPDIEPKITYTRSPDGDGVGNSYIEDNDDDSKMADLLS 
YFQQQLTFQESVLKLCQPELESSQIHISVLPMBVLMYIPRWWS 
SDLDLRSLEQLSLVCKGPYICARDPEIWRLACLKVWGRSCrKLV 
PYTSWREMFLERPRVRFDGVYISKTTYIRQGEQSIiDGFYRAWHQ 
VE YYRY IR FFPDGHVMMLTTPEEPQS I VPRLRTR 


6472 


1750 


299 


FFFDKiyiAAGGaGVGGKRSSKSDADSGFUSLRPTSVDPALRRRRR 
GPRNKKRGWRRLAQEPLGLEVDQPLEDVRLQERTSGGLLSEAPN 
EKLFPVDTGSKEKGLTKKRTKVQKKSLLLKKPLRVDLILENTSK 
VPAPKDVLMQVPNAKKLRRKEQLWEKLAKQGELPREVRRAQAR 
LLNPSATRAKPGPQDTVERPFYDLWASDNPLDRPLVGQDEFFLE 
QTKKKGVKRPARLHTKPSQAPAVEVAPAQASYNPSFEDHQTLLS 
AAHEVELQRQKEAEKLERQLAIiPATEQAATQESTFQELCEGLLE 
ESDGEGEPGQGEGPEAGDAEVCPTPARLATTEKKTEQQRRREKA 
VHRLRVQQAALRAARLRHQELFRLRGIKAQVALRLAELARRQRR 
RQARREAEADKPRRLGRLKYQAP0IDVQLSSELTDSLRTLKPEG 
NILRDRFKSFQRRMMIEPRBRAKFKRKYKVKr.VBKRAPRBlQL 




3 


897 


SCGSDRAQWAMEFPFDVDALFPERITVLDQHLRPPARRPGTTTP 
ARVDLQQQIMTI IDELGKASAKAQNLSAPITSASRMQSNRHWY 
ILKDSSARPAGKGAIIGFIKVGYKKliFVLDDREAHNEVEPLCrL 
DFYIHESVQRHGHGRELFQYMLQKERVEPHQIiAXDRPSQKLLKF 
I^KHYNLETTVPQVNNFVlFEGFFAHQHRPPAPSbRATRHSRAA 
AVDPTPAAPARKljPPKRAEGDrKPYSSSDREPLKVAVEPPWPLN 
RAPRRATPPAHPPPRSSSLGWSPERGPLRPFVP 


64 73 
6474 


22 


912 


SSAVEFVWEGEKMAAEPKKTEIQTLFKRLRAVPTNKACPDCGAK 
NPSWASITYGVPLCIDCSGVHRSLGVHLSPIRSTELDSNWNWFQ 
LRCMQVGGNANATAFFRQHGCTANDANTKYNSRAAQMYREKIRQ 
LGSAALARHGTDLWIDNMSSAVPNHSPEKKDSDFFTEHTQPPAW 
DAPATEPSGTQQPAPSTESSGLAQPEHGPNTDLLGTSPKASLEL 
KSSIIGKKKPAAAKKGLGAKKGLGAQKVSSQSFSEIERQAQVAE 
KLREQQiAADAKKQAEESMVASMRLAYQBI/QIDR 




3 


452 


i^QRQRQHPAAAPAVPVRCFTFCFTDIVIMPKRKSPENTEGKDGS 
KVTKQEPTRRSARLSAKPAPPKPEPKPRKTSAKKEPGAKISRGA 
KGKKEEKQEAGKEGTAPSENGETKABEIHISRSTVNVSTSRGTP 
PSTLSVKGQIETVRVKGTEN 


6475 
6476 ■ " 


3 


462 ~ 


LQRQRQHPAAAPAVPVRCPTFCFTDIVIMPKRKSPENTEGKDGS 
KVTKQEPTRRSARLSAKPAPPKPBPKPRKTSAKKEPGAKISRGA 
KGKKEEKQEAGKEGTAPSENGETKAEEIHISRSTVNVSTSRGTP 
PSTLSVKGQIETVRVKGTEW 




106 


1090 

1 


ARAMAQYKGTMREAQRAMHLLKKRBRQREQMEVIiKQRIAEETII, 
KSQVDKRFSAHYDAVEAELKSSTVGLVTLNDMKARQEALVRERE 
RQLAKRQHLEEQRLQQSRQREQEQRRERKRKISCLSFALDDLDD 
OADAAEARRAGNIXSKWPDVDTSFLPDRDREEEENRLREELRQEW 
BAQREKVKDEEMEVTFSYWDGSGHRRTVRVRKGNTVQQFLKECAL 
BGLRKDFLELRSAGVEQLMFIKEDLILPHYHTPYDFIIARARGK 
SGPLFSFDVHDDVRLLSDATMEKDESHAGKWLRSWYEKNKHIF 
PASRWBAYDPEKKWDKYTJtR 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

cor re spond i ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 

nucleotide 

location 

to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
[A=Alanine, C«Cyateine, D«Aspartic Acidi S= 
Glutamic Acid, P= Phenyl alanine, G=Glycine, 
H:=Hlstidinef l^Isoleucine^ K^Lysine, 
L=Leucine, M-Methionine, N-Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
WsTryptophan, Y^T'yrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotide insertion) 


6477 


227 


915 


LQGH1.MGIMAASRPLSRFWEWGKWIVCVGRNYADHVREMRSAVL 
SE PVLPLKPSTAYAPEGSPILM PAYTRNLHHELELGWMGKRCR 
AVPEAAAMDYVGGYALCIiDMTARDVQDECKXKGLPWTLAKS FTA 
SCP VS AFVPKEKI PDPHKLKLWLKVNGELRQEGETSSM I FS IPY 
IISrVSKI ITLEEGDI ILTGTPKGVGPVKENDEIEAGIHGLVSM 

T PIC VR IfD RV 


6476 


2 


1495 


FVSSRILPESLASSBASTLEAKGRKEEDDCSSWKKQTTNIRKTF 

IFMEVIiGSGAFSEVFLVKQRLTGKIjFALKCIKKSPAFRDSSLEN 

BIAVLKKIKHENIVTLEDIYESTTHYYLVMQLVSGGELFDRILE 

RGVYTEKDASLVIQQVLSAVKYLHENGXVHRDLKPENLLYLTPE 

ENSKIMITDFGLSKMBQNGIMSTACGTPGYVAPEVrAQKPYSKA 

VDCWS IGVl TY I LLCGYPPFYEETES KLFEKI KEQYYEFESPFW 

DDISESAKDFICHLLEKDPNERYTCEKALSHPWIDGNTALHRDI 

J esfvh JjQ 1 UKCJ r AKS ECWRQAFN AAAvVHHMRKXjHMNLHS PG VRP 

EVENRPPETQASETSRPSSPEITITEAPVLDHSVALPALTQLPC 

QHGRRPrAPGGRSLKCLVNGSIiHISSSLVPMHQGSLAAGPCGCC 

o JjW i taO K(jK5 5 ICSE PT IjbKKANKKQNF 

HCRAGQTGVCLIM 


6479 


3 


949 


SCRGPGWHPAGGQAGAMELLSALSLGELALSFSRVPLFPVFDLS 
YFIVSILYLKYEPGAVELSRRIIPIASWLCAMLHCFGSYIIiADLL 
LGEPLIDYFSNWSSILIASAVWYLIFFCPLDLFYKCVCFLPVKL 
I FVAMKEWRVRKI AV6IHHAHHHYHHGWFVM I ATGWVKGSGVA 
LMSNFEQLLRGVMKPETNEILHMSFPTKASLYGAILFTLQQTRW 
LPVSKASLIFIFTLFiWVSCKVPLTATHSHSSPPDALEGYICPVL 
FGSACGGDHHHDNHGGSHSGGGPGAQHSAMPAKSKEELSEGSRK 
KKAKKAD 


6480 


192 


514 


DFMSIYFPIHCPDYLRSAKMTEVMMNTQPMBEIGLSPRKDGLSY 
QIFPDPSDFDRCCKIjKDRLPSIWEPTEGEVESGELRWPPEEFL 
VQEDEQDNCBETAKEWKEQ 


6481 


110 


1131 


KSRMDLDWNMFVIAGGTLAI PILAFVASFLIiWPSALIRI YYWY 
WRRTLGMQVRYVHHEDYQFCYS FRGRPGHKPS 3 LMLHGFSAHKD 

mwiiswkflipkwlhlvcvdmpghegttrsslddlsidgqvkrih 
qfveclklnickpfhlvgtsmggqvagvyaayypsdvsslwlvcp 
aglqystdnqfvqrlkelqgsaavekiplipstpeemsemlqlc 
s yvrfkvpqq iiiqgiivdvr 1 phnnfyrklfbe ivseksryslhq 
nmdkikvptqi iwgkqdqvldvsgadmiiaks lancqvellencg 
hswmerprktakli idflasvhntdnnkkld 


6482 


2517 


568 


EPVSKVSQSRRKAGVPTANIEESQAVEAAMANVPWAEVCEkFQA 
ALALSRVELHKNPEKEPYKSKYSARALLEEVKALLGPAPEDEDE 

NHIDTEELSAGEEHLVKCLRLLRRYRLSHDCISLCIQAQNNLGl 
LWSEREE I ETAQAYLESSEAliYNQ YMKEVGSPPLDPTERFLPEE 
EKliTEQERS KRFEKVYTHNLYYLAQVYQHLEMFEKAAHYCHSTL 
KRQLEHNAYHPIBWAINAATLSQFYINKLCFMSARHCLSAANVI 
FGQTGKISATEDTPEAEGE VPELYHQRKGE lARCW IKYCLTLMQ 
NAQLSMQDNIGELDLDKQSELRALRKKELDEEESIRKKAVQFGT 
GELCDAISAVEEKVSYLRPIJDFEEARELFLLGQHYVFEAKEFFQ 
IDGYVTDHIEWQDHSALFKGLAFFETDMERRCKMHKRRIAMLB 
PLTVDUTPQYYLLVNRQIQFEIAHAYYDMMDLKVAIADRLRDPD 
SHIVKKlNWLNKSALKYYQLFLDSIiRDPKKVFPEHIGEDVriRPA 
MXAKFRVARLYGKIITADPKKEIiENLATSLKHYKFIVDYCEKHP 
KAAQElEVELEIiSKEMVSLLPTKMERFRTKMALT 


6483 


3 


623 


NSHLLCGLRARAPLSANGREARAMBQRLAEFRAARKRAGLAAQP 
PAASQGAQTPGEKAEAAATLKA7U>GWLKRFLVWKPRPASARAQP 
GLVQEAAQPQGSTSETPWNTAIPLPSCWDQSFL'miTFLKVIjLW 
LVLLGLFVELEFGLiAYFVLSIiPYWMYVGTRGPEEKKEGEKSAyS 
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SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 

*J iTxr e s p o n cmig 

to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide ' 
(At^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, Phenylalanine, G==Glycine, 
H==Histidine, I=Isoleucine, K=Lysine, 
L=:Leucine, MsMethionine, N=Asparagine , 
P= Proline, a=Glut amine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W^Tryptophan, Yr^Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








VFWPGCEAIQGTLTAEQLERELQLRPLAGR 


6484 


201 


965 


QLAVKTKMSGIiRPGTQVDPEIELFVKAGSDGESIGNCPFCQRLF 
MILWLKGVKFNVTTVDMTRKPEELKDLAPGTWPPFLVYNKELKT 
DFIKIEEFLEQTLAPPRYPHLSPKYKESPDVGCNLFAKFSAYIK 
NTQKBANKNPEKSLLKEFKRLDDYLNTPXjIiBEIDPDSAEEPPVS 
RRLPLDGDQLriiADCSLLPKIiNIIKVAAKKYRDFDIPAEFSGVW 
RYLHNAYAREEFTHTCPEDKE I ENTYANVAKQKS 


6485 




1091 


FVDLVRAVEFLPCPDSQKLEKECQSSEESMGSNSMRS I LEEDEE 
DEEPPRVLLYHEPRSFEVGMLVWHKHKKYPFWPAWKSVRQRDK 
KAS VLYI EGHMNPKMKGFTVS LKSLKHFDCKEKQTLLNQAREDF 
NQDIGWCVSIiI TDYRVRLGCXSSFAGS PLE YYAADI S YPVRKS IQ 
QDVI/3TKLPQLSKGSPEEPWGCPLGQRQPCRKMLPDRSRAARD 
RANQiaVEYIGKAKGAESHIiRAILKSRKPSRWLQTFLSSSQYVT 
CVETYLEDEGQLDLWKYLQGVYQEVGAKVLQRTNGDRIRFILD 
VLLPSAI ICAISAGDEVDYiCTAEEKYIKGPSIiSYREKE 1 PDNQL 
LEERNRRRR 


6486 


10 


581 


LVLQAGGAHLS P SRVTQG I YYMLAFSEMPKPPDYSELSDSLTLA 
GGTGRPSGPLHRAWRMMNFRQRMGWIGVGLYLIASAAAFYYVFE 
ISETYNRLALEHXQQHPEEPLEGTTWTHSLKAQLIiSLPFWVWTV 
IFLVPYLQMFLFLYSCTRADPKTVGYCIIPICliAVICNRHQAFV 
KASNQISRLQLIDT 


S487 


352 


863 


SFLKPLRGKMSVTLHTDVGDIKIEVFCERTPKTCENFLALCASt^ 
YYNGCI FHRNIKGFMVQTGDPTGTGRGGNS IWGKKFEDE YSBYL 
KHNVRGWSMANNGPNTNGSQFFI TYGKQPHLDMKYTVPGKVID 
GLETLDELEKLPVNEKTYRPLNDVHI KDITIHANPPAQ 


6488 


878 


241 


TALQEFGTSGPPLSLRFALPSGTORPKPLFGARGPSWPPSPRVP " 
MEPPNIiYPVKLYVYDLSKGliARRLSP IMLGKQLEGIWHTS I WH 
KDEFPF6SGGI SSCPPGGTLLQPPDSWDVGSTEVTEEI FLEYL 
SSLGESLFRGEAYNLFEHNCNTFSNE VAQFIjTGRKI PS Y ITDLP 
SEVLSTPFGQAIJiPLLDS I Q XQP PGGSS VGRPWGQS 


64B9 


1457 


375 


KVAKMATALSEEELDNEDYYSLLNVRREASSEBLKAAYRRLCML 
YHPDKHRDPELKSQAERLFNLVHQAYEVLSDPQTRAIYDIYGKR 
GLEMEGWEWERRRTPAEIREEFERLQREREERRLQQRTNPKGT 
ISVGVDATDIiFDRYDEEYEDVSGSS FPQI EINKMHI SQS I EAPL 
TATDTAILSG3LSTQNGNGGGSINFALRRVTSAKGWGELEFGAG 
DLQGPXjFGLKLFRNLTPRCFVTTNCALQFSSRG I RPGLTTVIiAR 
NLDKNTVG YLQWHCSSPLLQVQRPHRNTRACAPE PSFRPPLHVP 
TWUAECSGARTPSrAWTSAAVKLREACLSGPGSGSHQLLLLTPR 
SKRRTGGG 


6490 


3 


1183 


heagcevwlgygpraaaaaaatvlfggagpretmfvars laajdh 
kdlihdvsfdfhgrrmatcssdqsvkvwdksesgdwhctaswkt 
hsgs vwrvtmahpepgqvlas csfdrtaavwee i vgesitoklrg 
qshwvkrttlvdsrtsvtdvkfapkhmgiimlatcsadgivriys 

APDVMNLSQWSLQHBISCKLSCSCISWNPSSSRAHSPMIAVGSD 
DSSPNAMAKVQ I FE YNEKTRKYAKAETLMTVTDPVHDIAPAPNL 
GRSFHI LAI ATKDVRIPTLKPVRKELTSSGGPTKFE IHI VAQFD 
NHNSQVWRVSWNITGTVLASSGDDGCVRLWKANYMDNWKCTGIL 
KGNGS PVNGSSQQGTSMPSLGSNIPS LQNSIiNQSSAGRKHS 


6491 


3 


1183 


HBAGCE VWLGYGPRAAAAAAATVLFGGAGPTETMFVARS lAADH 
KDLIHDVS FDFHGRRMATCS SDQSVKVWDKSESGDWHCTAS WKT 
HSGSVWRVTWAHPEFGQVLAS CS FDRTAAVWEEI VGESNDKLRG 
QSHWVKRTTIiVDSRTSVTDVKFAPKHMGIoMLATCSADGIVRIYB 
APDVMNLSQWSLQHEISCKL S CSCISWNPSS SRAHSPMI AVGSD 
DSSPNAMAKVQIFEYNENTRKYAKAETLMTVTDPVHDXAFAPNL 
ORSFHIIAIATKDVRI PTLKPVRKELTSS GGPTKFE IHI VAQFD 
NHNSQWRVSWNITGTVLASSGDDGCVRLWKANYMDNWKCTGI L 
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SEQ 
ID 

mt 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A«Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanlne, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
Ii^Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, VoValine, 
W^Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /'=possible nucleotide deletion, 
\=pQ3sible nucleotide insertion) 


6492 
6493 


34 


2573 


KGNGSPVNGSSQQGTSKPSLGSNIPSLaNSLNGSSAGfe'ialS 

IFFl.KSCCCCCI.FDFPPPPl.DQVQEEECEVERVtEHGTPKPPRK 

FDSVAFGESQSEDEQPENDLETDPPNWQQLVSREVULGLKPCEI 

KRQEVlNELFYTERAHVRTLKVLDQVFyQRVSREGILSPSEtiRK 

IFSNLEDILOiHIGLNEQMKAVJRKPjJETSVIDQIGEDLLTWFSG 

PGEEKLKHAAATPCSNQPFALEMIKSRQKKDSRFQTFVQDAESN 

PliCRRLQLKDIIPTQMQRLTKYPLLLD^TIATYTEWPTEREKVKK 

AADHCRQILNYVNQAVKEAENKQRLEDYQRRLDTSSLKLSEYPN 

VEELRNLDLTKRKMIHEGPLVWKVNRDKTIDLYTLLLKDILVLL 

QKQDDRLVLRCHSKIIASTADSKHTFSPVIKLSTVLVRQVATDN 

KALFVISMSDNGAQIYELVAQTVSEKTVWQDLICRMAASVXEQS 

TKPIPLPQSTPGEGDNDEEDPSKLKEEQHGISVTGLQSPDRDLG 

LESTLlSSKPQSHSLSTSGKSEVRDliFVAERQFAKSQHTDGTLK 

EVGEDYQIAIPDSHLPVSEERWALDALRNIiGLLKQLLVQQLGLT 

EKSVQEDWQHFPRYRTASQGPQTDSVIQNSENIKAYHSGEGHMP 

FRTGTGOIArCYSPRTSTBSFAPRDSVGLAPQDSQASNILVMDH 

MIMTPEMPTMEPEGGLDDSGEHFFDAREAHSDENPSEGDGAVNK 

EEKDVNLRISGNYLILDGYDPVQESSTDEEVASSLTLQPMTGXP 

AVESTHQQQHSPQNTHSDGAISPPTPEFLVQQRWGAMEYSCPBI 

QSPSSCADSQSQIMBYIHKXEADLEHLKKVEESYTILCQRtAGS 
ALTD KHS DtCS 


S494 


557 


1147 


rPARMAYQGaSTSDCMSKTLDSASAHFAASAWSAPVPSRSEVA 
KEQNTGHNNINGWQPSGTSKTLYSTNMALSSSPGISAVQLVRT 
VGHTTTNHLI PALCTSSPQTLPMNNSCLTNAVHLNNVS WS P VN 

VHINTRTSAPSPTALKLATVAASMDRVPKVTPSSAISSIARENH 
EPERLGLNGIAETTVAMEVT 


6495 


242S 


1052 


AVAGGARPCSTPSSPHRRCRRHRPRPLPRPPAAIMSASAVYVLD 
r)KGKVLICRNYRGDVDMSBVEHFMPILMEKEEBG^lLSPlLAHGG 
VRFMWIKHNNLYLVATSKKNACVSLVFSFLYKWQVFSEYFKEL 
SEESIRDNFVIIYELLDELMDFGYPQTTDSKILQEYXTQEGHKL 
BTGAPRPPATVTNAVSWRSEGIKYRKNEVPLDVIBSVNLLVSAN 
GNVLRSEIVGSIKMRVFLSGMPELRLGLNDKVLFDNTGRGKSKS 
VELEDVKPHQCVRLSRPENDRTI S PI PPDGEFEI.MS YRXiNTHVK 
PLIWIES VIEKHSHSR I EYM IKAKSQFKRRSTANNVE IHIP VPN 
DADSPKFKTTVGSVKWVPiSKSEIVWSIKSFPGGKEyLMRAHFGL 

PSVEAEDKEGKPPISVKFEIPYFITSGIQVRYLKIIEKSGYQAL 
PWVRYITQNGDYQLRTQ 


6496 


2425 


1052 


AVAGGARPCSTPSSPHRRCRRKRPRPLPRPPAAIKSASA\>V\/LD 

LKGKVLI CRWYRGDVDMSEVEHPMPlLMEKEEEGKIiS P I LAHGG 

VRFMWIKHNNLYLVATSKKNACVSLVPSFLYKWQVPSBYFKEL 

EEESIRDNPVIIYELLDEU^DFGYPQTTDSKILQEYITQEGHKI, 

ETGAPRPPATVTWAVSWRSEGIKYRia?EVPIiDVlESVNLI,VSAN 

GNVLRSEIVGSIKMRVFLSGMPBLRLGLNDKVLFDNTGRGKSKS 

VELEDVKFHQCVRLSRFENDRTISPIPPDGEFELMSYRLNTHVK 

PLIWIESVIEECHSHSRIEYMIKAKSQFKRRSTANNVEIHIPVPN 

DADSPKFKTTVGSVKWVPENSEIVWSIKSPPGGKEYLMRAHFGL 

PSVEAEDKEGKPPISVKFEIPYPTTSGIQVRYLKIIEKSGYQAL 
PWVRYITQNGDYQLRTQ 


64 97 


347 


559 


Li?AVSLLPLQ]jVLPEySIHSLFCIMFLCAQEWLTLGLNVp"riLFY 
HFWRYFHCPADSSEIiAYDPPWI^ADTLSYCQKEAWCiCLAFYLL 
SFFYYLYCMIYTLVSS 




1053 


352 

( 

: 

2 


^VNTQICRLCPRRHLHPPCGAKMGNGTEEDYNFVFKVVLIGESGV" 

SKTKLLSRPTRKrBFSHDSRTTIGVEFSrRTVMLGTAAVKAQIWD 

rAGLERYRAITSAYYRGAVGALLVFDLTKHQTYAWERWLKELY 

DHAEATIWMLVGNKSDLSQARBVPTBEARMPAENNGLLPLETS 

\LDSTNVBIAFErVLKEIFAKVSK0RQNSIRTNAITLGSAOAGQ 
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SEO 
ID 
NO: 


Predicted ~ 
beginning 
nucleot ide 
location 
corr e spending 
to first 
amino acid 
residue of 
amino acid 
secfuence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Am3.no acid segment containing signal peptide 
(A-Alanine, C=CyBteine, D^sAspartic Acid, E= 
Glutamic Acid, F-Phenyl alanine, G=^Glycine, 
H=Histidine, I=Isoleucine, K=Iiysine, 
LsLeucine^ M=Methionine, N=^Asparagine , 
P=Prollne, Q=Glut amine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y«Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\spossible nucleotide insertion) 
BPGPGEKRACCISL 


6498 


2636 


2 72 


SLRLCPWOTHIiAGPTTMRLSSLlALIiRPALPLILGLSLGCSLSL 
LRVSWIQGHGEDPCVEAVGERGGPQUPDSRARLDQSDBDPKPRI 
VPYYRDPNKPYKKVLRTRYIQTELGSRERLLVAVLTSRATLSTL 
AVAVNRTVAHHFPRLLYPTGQRGARAPAGMQWSHGDERPAWLM 
SETLRHLHTHFGADYDWFPIMQDDTYVQAPRLAAIiAGHLSINQD 
LYLGRAEEFIGAGEQARYCHGGFGYLLSRSLIiLRIiRPHIiDGCRG 
DILSARPDEWLGRCL IDS LGVGCVSQHQGQQYRSFELAKNRDPE 
KEGSS AFLSAFAVHP VSEGTLMYRLHKRFSAI>ELERAYS E I EQL 
QAQIRNLTVLTPEGEAGLSWPVGliPAPFTPHSRFEVLGWDYFTE 
QHTFSCADGAPKCPLQGASRADVGDALETALEQLNRRYQPRUeP 
QKQRLLNGYRRFDPARGMEYTLDLLLECVTQRGHRRALARRVSL 
LRPLSRVPILPMPYVTEATRVQLVLPLLVAEAAAAPAFLEAFAA 
NVLHPREHALLTLLLVYGPREGGRGAPDPFLGVKAAAAELERRY 
PGTRLAWLAVRAEAPSQVRLMDWS KKHPVDTLFFLTTVWTRPG 
PEVLm^CRMNAISGMQAPFPVHFQBFNPArjSPQRSPPGPPGAGP 
DPPSPPGADPSRGAPIGGRFDRQASAEGCFYNADYLAAHARLAG 
BLAGQBEEEALEGLEVMDVFLRPSGI,HLFRAVEPGLVQKFSL»RD 
CSPRLSEELYHRC3?LSNLEGLGGRAQI»AM7U,FEQEQAKST 


6499 


3 


2040 


SCSADTRPSGQAWPTVGLRAAAGAFRTGSPLALGPETPQVACLP 
GHPPVRPQVSGGPGAMPDPAAHLPFF YGS ISRAEAEEHIiKIAGM 
ADGLFLLRQCLRSLGGYVLSLVHDVRFHHPPIERQLNGTYAIAG 
GKAHCGPAELCEFYSRDPDGLPCNLRKPCNRPSGLEPOPGVFDC 
LRDAMVRDYVRQTWKLEGEAIiEOAIISQAPQVEKIilATTAHERM 
PWYHSSLrREEAERKLYSGAQTDGKFLLRPRKEQGTYALSLIYG 
KTVYHYLXSQDKAGKYCIPEGTKFDTLWQLVEYLKIiKADGLIYC 
LKEACPNSSASNASG;^PTr.PAHPSTLraPQRRIDTLNS DGYT 
PEPARITSPDKPRPMPMDTSVYESPYSDPEELKDKKLFLKRDKL 
LIADIELGCGNFGSVRQGVYRMRKKQIDVAIKVLKQGTEKADTB 
EMMREAQIMHQLDNPYIVRLIGVCQAEALMLVMEMAGGGPIjHKF 
LVGKREEXPVSNVABLLHQVSMGMKYLESKNPVHRDLAARNVLL 
VNRHYAKISDFGLSKALGADDSYYTARSAGKJ^Lia^S^APECINF 
RKFSSRSDVWSYGVTMWEALSYGQKPYKKMKGPEVHAFIEQGKR 
MECPPECPPELYALMSDCWIYKWEDRPDFLTVEQRMRACYYSLA 
S KVEGPPGSTQKAEAACA 


6500 


1773 


726 


TGPTHASADAWG1.VRSVTBWCANVRGWPCAAALSCPQAVI,DAGK 
MLSESSSPLKGVMLGSIFCALITMLGHIRIGHGNRMHHHEHHHIj 
QAPNKEDTLKISEDBRMELSKSPRVYCIILVKPKDVSLWAAVKE 
TWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDK 
YRDQYNWFPLARPTTFAlIENLKYFIiLKKDPSQPFYLGHTIKSG 
DLEYVGMBGGIVLSVSSMKRLNSLIiNIPEKCPEQGGMIWKISED 
KQLAVCLKYAGVFAENAEDADGKDVFKTTKS VGLS I KEAMTYHPN 
QWEGCCSDMAVTFNGLTPNQMHVMMYGVYRIiRAFGPYFQ 




6 SOI 


1 




iiVGMSGGGTETPVGGEAAPGGGSKKRDSLGTAGSAHLIIKDLGE 
IHSRLLDHRPvrQGErRyFVKBFEEBCRGLREKRVLENLKNMIHE 
TNEHTLPKCRDTMRDSLSQVLQRLQAANDSVCRLQQREQERKKI 
HSDHLVASEKQHMLQWDNFMKEQPNKRAEVDEEHRKAMBRLKEQ 
YAEMEKDLAKFSTF 




6502 


213 


1650 


AGNKPDPWAGRNRTAVLPDVSVPHREDVGWWRSWLQQSYQAVKB"" 
KSSEALEFMKRDLTEFTQWQHDTACTIAATASWKBKLATEGS 
SGATEKMKKGLSDFLGVTSDTFAPSPDKTIDCDVITLMGTPSG? 
AEPYDGTKARLYSLQSDPATYCISrEPDGPPELFDAWIiSQFCIliBEK 
KG2ISELLVGSPS IRALYTKMVPAAVSHSEFWHR YFYKVHQLEQ - 
EQARRDALKQRAEQS ISEEPGWEEEEEELMGIS PI S PKEAKVPV 
AKrSTFPEGEPGPQSPCEENLVTSVEPPAEVTPSESSESISIiVT 
QIANPATAPEARVLPKDLSQKLLBASLEEQGIiAVDVGETGPSPP 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acxd segment containing signal peptide" 
(A=Alanine, C^Cysteine, D=Aspartic Acid, £« 
Glutamic Acid, F=Plienylalanine, G=Glycine, 
HaHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, K=Methionine, N=Asparagine , 
P^Praline, Q^Glut amine, R=Arginine, 
SaSerine, T=Threonine, v=Valine, 
Ws=Tryptophan, y=,Tyrosine, X=Unknown, *-Stop 
Codon, /=pos Bible nucleotide deletion, 
\«possibIe nucleotide insertion) 








IHSKPLTPACJHTGQPEPRPPARVETIiREEAPTDLRVFfiLNSbSG 
KSrPSNNGKKGSSTDISEDWEKDFDIJ3MTEEEVQI^ALSKVnASG 
EVSGPGGSEGSEPWGPGCESSPQPAQLSPQEGPCSCLR 


6503 
6504 


213 


1650 


AGNKPDPWAGRNRTAVLPDVSVFHREDVGMWRSWLQQSYQAVKE 
KSSEALEFMKRDLTEPTQWQHDrACTIAATASWKEKLATEGS 
SGATEKMKKGLSDPLGVISDTFAPSPDKTIDCDVITLMGTPSGT 
AEPYDGTKARLYSLQSDPATYCNEPDGPPELFDAWLSQFCZjEEK 
KGEISBLLVGSPSIRALYTKMVPAAVSHSEFWHRYFYKVHQLEQ 
EQARRDALKQRAEQSISEEPGWEEEEBELMGISPISPKEAKVPV 
AKISTFEEGEPGPQSPCEKNLVTSVEPPAEVTPSESSESISLVT 
QIANPATAPEARVLPKDLSQKLLEASLEEQGIiAVDVGETGPSPP 
IHSKPLTPAGHTQGPEPRPPARVETLREEAPTDLRVFELNSDSG 
KSTPSNNGKKGSSTDISEDWEKDFDLDMTBEEVQMALSKVDASG 
SVSG?GGSEGSEPNGPGCESSPQPAQIiSPQEGPCSCI,R 




2131 


1294 


GKVC-^VAKW VCLS ILS PPPAGMKTPNAQEAEGQQTRAAAGRATG 
SANMTKKKVSQKKQRGRPSSQPCRNIVGCRISHGWKEGDEPITQ 
WKGTVLDQVPINPSLYLVKYDGIDCVYGLELHRDERVLSLKILS 
DRVASSHISDANLANTI IGKAVEHMFEGEHGSKDEWRGMVLAQA 
PIMKAWPYITYEKDPVLYMYQLLDDYKBGDLRIMPESSESPPTB 
REPGGWDGLIGKHVEYTKEDGSKRrGMVIHQVEAKPSVYPIKF 
DDDFH I YVYDL VKKS 


6505 


2131 


1294 


gkvclvahwvclsilspppagmktpnaqeabgqqtraaagratg" 

SANMTKKKVSQKXQRGRPSSOPCRWIVGCEISHGWKEGDEPITQ 

wkgtvldqvpinpslylvkydgidcvyglelhrdervlslkils 

DRVASSHISDAKI>ANTI IGKAVEHMFEGEHGSKDEWRGMVUVQA 

pimkawfyityekdpvlymyqllddykegdlrimpessbsppte 
repggwdgligkhveytkedgskrigmvihqveakpsvyfikp 

DDDFHIYVYDIiVKKS 


6506 


1 


1350 


evspptsccltvavadpgvsegfrgfgagcempgrgrcpdcgst 

ELVEDSHYSQSQLVCSDCGCWTEGVIiTTTFSDEGKTLREVTYSR 

stgeneqvsrsqqrglrrvrdlcrvlqlpptfbdtavavyqqay 

RHSG IRAARLQKKEVLVGCCVLITCRQHNWPLTMGAI CTIiLYAD 

LDVPSSTYMQIVKLLGLDVPSLCLAELVKTYCSSFKI^FQASPSV 

PAKYVEDKEKMLSRTMQLVELANETWLVTGRHPLPVITAATPliA 

WQSLgPADRLSCSLARFCKLANVDLPYPASSRLQELLAVLLRMA 

EQLAWLRVLRLDKRSWKHIGDLLQHRQSLVRSAFRDGrABVET 

REKEPPGWGQGQGEGBVGNNSLGIiPQGKRPASPALLLPPCMLKS 

PKRICPVPPVSTVTGDENISDSEIEQYLRTPQEVRDFQRAQAAR 
QAATSVPKPP 


6507 


1878 


929 


RSHASRLPEI.PSGCLVLQVQELVQ^aSGMEATVTI P I WQNKPHGA 
ARSWRRIGTNLPLKPCARASFETLPNISDI^CLRDVPPVPTLAD 
lAWIAADEEETYARVRSDTRPLRHTWKPSPLIVMQRKASVPNLR 
GSEERLLALKKPALPATiSRTTEIjCSDELSHLRSQIAKIVAADAAS 
AS LTPDFL S PGSSNVSS PL P C FGSS FHS TTS PVI S D T TRTTTm n? 

VPELPSVPIiLCSASPECCKPEHKAACSSSEEDDCVSliSKASSFA 

DMMGILKDFHRtyiKQSQDLNRSLLKEEDPAVLISEVLRRKFALKE 
EDISRKGM 


6508 


862 


342 


wlfiAKKKPQRWPSERREVRVPPPHLQRGRSGLEPGTFRKMAAARP 
SliGRVLPGSSVLFLCDMQEKFRHNIAYFPQIVSVAARMLKNTTL 

dlldrglqvhvwdacssrsqvdrlvalarmrqsgaflstsegl 

ILQLVGDAVHPQFKBIQKLIKEPAPDSGLLGLFCXSQNSLLH 


6509 " 


2 


1053 


fvivnprggrkrrrqaavtqaatrasgtpsprdgtmtqgklsvan 
kapgtegqqqvhgekkeapavpsappsyeeatsgegmkagafpp 
fvptavplhpsmayvdpsssssydwgfptgdhelpttpswddqkv 
elrvfvrkvytilliqllvtlawalftpcdpvkdyvqatlpgwyw 
\syavffatyltlaccsqprrhfpwnlilltvftlskayltgml 
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SEQ 
ID 
NO: 


"predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


j^uixuu aoiu segment concaaning signal peptide 
(A=Alanine, CaCysteine, D^Aspartic Acid, E= 
Glutamxc Acid, P=Phenylalanine, G=Glycine, 
H=Hifitidine, I=Isoleucine, K=Lysine, 
Iiabeucine, M=Methionine, N-Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, y=Tyrosine, x^UnkAown, *«stop 
Codon, /=possible nucleotide deletion, 
_ \ ^vop.LAjj.c nti.c-.eocacie insertion) 


6510 






SSYYNTT5Vlibt:LGITALVCLSVTVFSFQTKFDFTSCQGVLFVL 
liMTLFFSGLIIAILLPFQWPWLHAVYAALGAGVFTLPLALDTQ 
LLMGNRRHSLSPEEYIFGALNIYLDIIYIFTPFLQLFGTNRE 


6511 


37 


1156 


PCALDGCPUKUAVHPLLSSAMGLIAPLKTQFVLHLLVGFVFWS 
GbVINFVQLCTLALWPVSKQLYRRLNCRlAYSLWSQLVMLLEWW 
SCTECTLFTDQATVERFGKBHAVIILNHNFEIDFLCGWTMCERF 
GVLGSSKVIAKfCELLyVPLlGWTWYFLEIVFCKRKWEEDRDTW 
EGLRRLSDYPEYMWFLLYCEGTRFTETKHRVSMEVAAAKGLPVL 
KYHLLPRTKGFTTAVKCLRGTVAAVYDVTLNFRGNKNPSLLGIL 
YGKKYEADMCVRRFPLEDIPLDEKEAAQWLHKLYQEKDALQEIY 
NQKGMFPGEQFKPARRPWTLLNFLSWATILLSPLFSFVLGVFAS 
GS PLLILTFLGFVGAGNGHCR 


6512 


2^41 


1425 


GEEQPLAAAi'TBCljEQVIGGAGDPGTWASFPSPLPGPAPLKGGK 
TMATNFSDIVKQGYVKMKSRKLGXYRRCWLVFRKSSSKGPQRLS 
1 KYPDEKSVCLRGCPKVTEISNVKCVTRLPKETKRQAVAIIPTDD 
SARTFTCDSELEAEEWYKTLSVECLGSRLNDISLGEPDLLAPGV 
QCEQTDRFNVPLLPCPNLDVYGECKLQITHENIYLWDIHNPRVK 
LVSWPLCSLKJRYGRDATRFTFEAGRMCDAGEGLYTFQTQEGBQI 
YQRVHSATLAIAEQEKRVLLEMEKNVRLLNKGTEHYSYPCTPTT 
MLPRSAYWHHtTGSQNIAEASSYAGEGYOAAQASSETDLLNRPI 
LLKPKPSQGDSSEAKTPSQ 


S513 


159 


807 


FGKKSTWFtl^RSUiVASGRSCKLGHGGYTGSGPGFGEPRDSGA 
EVPSGSGRATGCBRC-GVRGARQGRAPGSSIWRKEPRMVCTRKTK 
TLVSTC VILSGMTNI I CLLYVGWVTNYI ASVYVRGQBPAPDiCKL 
HiiDKGDTLKIIERUJHLENVIKQHIQEAPAKPEEAEAEPFTDSS 
LPAHWGQELSPEGRRVALKQFQYYGYNAYLSDRLPLDRP 


6514 


2 


756 


FVSPEPGFSLAgu^IWQLTDTKQLVHSFAEGQDQGaAYANRTA 
LFPDLLAQGNASUiLQRVRVADEGSFrCFVSIRDFGSAAVSLQV 
AAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVFWQDGQ 
GVPLTGNVTTSQMANEQGLFDVHSII^RWLGANGTYSCLVRNPV 
liQQDAHS SVTITPQRSPTGAVEVQVPEDPWALVGTDATLRCS P 
SPEPGFSLAQLNLIWQLTDTKQLVHSFAEGQDQGSAYANRTALP 
PDLLAQGNASLRLQRVRVADBGSPTCFVSIRDFGSAAVSLQVAA 
PYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVPWQDGQGV 
PLTGNVTTSQMANEQGLFDVHSILRWLQANGTYSCLVRNPVLQ 
QDAHSSVTITPQRSPTGAVEVQVPBDPWALVGTDATLRCSPSP 
EPGFSLAQIiNIjIWQLTDTRQLVHSFTEGR 


6515 


985 


302 


VeiPGPTISSAAEMEDLLDlOEEIJiYSIATSRAKMGRRAQQESA 

UM/UVHliNGKUS SLTLTGETSSAKLPRCRQGGWAGDS VKASKFRR 

KASEEIEDPRLRPQSLNGSDYGGDIPIIPDLEEVQEEDFVLQVA 

APPSIQIXRVMTYRDLDNDLMKYSAIQTLDGEIDLKI.LTKVLAP 

EHEVRERNPSWQDDVGWDWDHtiFTBVSSSVLTEWDPIjQTEKEDP 
AGQARHT 


6516 


1345 


305 


GRVGSRRRSAAVPGGCGAGSTULEVSASASCGALGSAbMNPIVV" 

VHGGGAGPI SKDRKERVHQGMVRAAT VG YGILREGGSAVDAVEG 

AWALEDDPEFNAGCGSVLNTNGEVEMDASIMDGKDLSAGAVSA 

VQCIANPIKLARLVMEKTPHCPLTDQGAAQFAAAMGVPEIPQEK 

LVTERNKKRLEKEKHEKGAQKTDCQKNLGTVGAVALDCKGNVAT 

ATSTGOIVWKMVGRVGDSPCLGAGGYADNDIGAVSTTGHGESIL 

[CVNLARLTI,FHIEQG:<TVEEAADLSLGYMKSRVKGriGGt.IWSK 

rGDWVAKWTSTSMPWAAAKDGKLHFGIDPDDTTITDLP 




1 


14 02 

] 
1 
1 

] 


t^RHbRYLGQDATAAARDLRTRGLQGYCPSATAR^iQVLVSALQQL 
CGRRSEHRNENQEMPYSTNKELILGIMVGTAGISLLLLWYHKVR 
CPGlAMKLPBFLSLGNTFNSITLQDEiriDDQGTTVIFQBRQLQX 
^KLNELLTNMEELKEBIRFLKEAIPKLEEYIQDELOGKITVHK 
CSPQHRARKRRLPTIQSSATSNSSEEAESBGGYITAWTDTEEQS 
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SEQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


freaicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


«riino acid segment containing signal -peptide 
<A«Alanine, C=Cysteine, D=Aspartic Acidf B- 
Glutamic Acid, F=Phenylalanine. G:=Glycine. 
H-Histidine, l=Isoleucine, K^^Lysine 
L=Leucine, M=Methioiilne, N^Asparagine 
P=Prollne, Q=Glutamine, R^Arginine, 
S=Serlne, T=Threonine, V^Valine, 
W=.Tryptophan, Y=Tyrosine, X»Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\-possible nucleotide insertion) 




S517 






r^v^KAFNTRVbKLNUlVLLQKVUHLRMSKSGK^iKayELLRDHK 
EKFRDEIEFMWRFARAYGDMYSLSTNTQEKKHYAUIGKTLSERA 
INRAPMNGHCHLWYAVLCGYVSEPEGLQNKINYGHLFKEHLDIA 

ALLLPTVTKEDKBAQKEMQKXMTSLKR ^rt^i. 




6518 


3 


1414 


^KVWUGSSSLNAMVYVRGHAEDYERWQRQGARGWbYAHCLPyPR 
KAQGHELGASRYRGADGPLRVSRGKTNHPLHCAFLEATOQAGYP 
LTEDMNGFQQEGFGWMDMTIHEGKRWSAACAYLHPALSRTNLKA 
EAETLVSRVLFEGTRAVGVEYVKNGQSHRAYASKEVILSGGAIW 

o^I^xi^^^^^^^^^^^'^^^^E^WKFTOEGATAHLETGGF^ 
t^o''''^^^''^^^^^^^«^^^^QEAYQVHVGPMSTS^ 
GWLKLRSANPQDHPVIQPNYLSTETDIEDFRLCVKI^TREIFAQE 
^«!^f^^^°^^^^Q^°^^^A™^AKADSAYHPSCTCKMGQ 
PSDPTAWDPQTRVLGVENLRWDASIMPSMVSGNLWAPTIMIA 
EKAADIIKGQPALWDKDVPVYKPRTLATOR ^"^PTIMIA 




6519 


242 


1098 


nS^^^^^^^^^^^^^^'^'^^P^^^P^RRJ^HRLJ^RAVPGPSR 

TPTLSPSQQPLPTELNVTSPSKBECGPCTDTAHVSLITPTKRSC 

GTDSQSBNEASPVKRPRLLENTERSBETSRSKQKSRRRCFQCQT 

^^!^S?^^^^^^^^^"^^P^QHDCTPDHMGRGREEAIM 
KMVKLDRKVGRSCQRIGEGCS v'^c-iia-lm 




6520 


3 
3 


r 1113 


r^Ki^FPSPVHCl^AAAAPTAlVSEKEPFGKLQLSSRDPPGSLS 
VMNE^KKENGEVKILI,KSGKEKPKTN1EDLQIKKVKKKKKKKHK 


6521 




1113 


li^Ki^^PPSPVHCWVAAAPl'A'IVSEKEPFGKbCjLSSRDPPGSLS- 
AKKVRTEEKKAPRRVNGBGGSGGNSRQI^PPA^PSPQ^^^^ 

WSPAPLSAAPSPSSSRSSPSFSAGTAVPSSASASLSQPGPRKLL 

GGARBAGGASREENGEVKPLPRDKIKDKIECERDKEKEREKKKHiC 
VMNEIKJCENGEVKXLLKSGKEKPKTNIEDLQIKKVKKKKia^K 


6522 


184 
1042 


1798 

: 

I 

I 
I 

» 

L 
F 
K 

391 " N 


KLFKKATDTiiUUELVHPKALPLIVGAQLIHADKLGEKVSDSTMP"- 

IRSTVNSTRETPPKSKIAEGEEBKPBPDISSEESVSTVEEQENE 

rPPATSSEAEQPKGEPENEEKEENKSSEETKKDEKDQSKEKEKK 

/KKTIPSWATLSASQIJU^AQKQTPMASSPRProroAILTEAIKAr 

?QKSGASWAIRKYIIHKYPSLELERRGYLLKQALKRELNRGVI 

.QRAVERGQLEQITGKGASGTFQLKKSGEKPLLGGSLMEYAILS 

aAAMNBPKTCSTTALKKYVLBNHPGTNSNYQMHLLKKTLQKCB 

TCWMEQlSGKGFSGTFQLCFPyypsPGVLPPKKEPDDSRDEDE 

EDESSBEDSEDEBPPPJCRRLQKKTPAKSPGKAASVKQRGSKPA 

KVSAAORGKARPLPKKAPPKAKTPAKKTRPSSTVIKKPSGGSS 
KPATSARKE 

i^wx.KPSPRSHRTPJ:iSGRVLSLFRLPPPGMALSGSTPAPCWEED 
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SEQ 
ID 


Predicted ' 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cyeteina, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G==Qlycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
Ii^^Leucine, M=Methxonine, N^^Asparagine, 
P= Proline, Q=Glutamine, R=Arginine, 
S=Serine, T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








ECLDYYGMliSJjHRMFEWGGQLTECELELLAFIiLDEAPGAAGGL 
SRARSGLKbLLELERRGQCDESNLRLLGQIiIiRVLARHDLLPHLA 
RKRRRPVSPERYSYGTSSSSKRTEGSCRRRRQSSSSANSQQGSP 
PTKRQRRSRGRPSGGARRRRRGPQPHPSSSQSPPDLPLKAK 


6523 


2 


1097 


ASCQTRRRTAAIiDSGERIAGRRSPIAIiAMASNPNDIVKQGYVKl " 

RSRKLGIFRRCWLVFKKASSKGPRRLBKFPDEKAAYPRNFHICVT 

ELHNI KNI TRLPRETKKHAVAI I FHDETSKTFACESELEAEEWC 

KHLCMECLGrai^ISLGEPDLlAAGVQREQNERFimrLMPTPN 

LDIYGECTMQITHENIYLWDIHNAKVKLVMWPLSSLRRYGRDST 

WFTFESGRMCDTGEGLFTFQTREGEMIYOKVHSATIiAIA.PnHT?P 

LMLEMEQKARLQTSLTEPMTLSKSISLPRSAYWHHITRQNSVGE 

lYSLQGNHENRHSDLTGKSCKTSENRFLEEMAPLVMYGITHHLF 
MDTSTCKWHDLE 


6524 


2 


1057 


AS CQTRRRTAALDSGBRIAGRRS P lALAMASWFNDI VKQG YVKI " 

RSRKLGIPRRCWLVFKKASSKGPRRIiEKFPDEKAAYFRNFHKVT 

ELHNIKNITRIiPRETKKHAVAIIFHDETSKTFACESELEAEEWC 

laiLCMECLGTRLNDISLGEFDLLAAGVQREQNERPlinriirLMPTPN 

LDIYGECTMQITHENIYLWDIHNAKVKLVMWPLSSLRRYGRDST 

WFTPESGRMCDTGEGLFTFQTRE6EMIYQKVHSATLAIAEQHER 

LMLEMEQKARLQTSLTEPMTI*SKSISLPRSAYWHHITRQNSVGE 

lYSLQGNHENRHSDLTGKSCKTSENRFLBBNAPLVMYGlTHHLF 

MDTSTCKWHDLE 


6525 


1 


1859 


GESPFSEEESIEFNPSSSGRSARTVSSNSFCSDDTGWPSSQSVS " 

PVKTPSDAGNSPIGFCPGSDEGFTRKKCTIGMVGEGSIQSSRYK 

KES KSGLVKPGS EADFSSSSS TGS I SAPEVHMSTAGSKRSS SSR 

NRGPHGRSNGASSHKPGSSPSSPREKDLLSMLCRNQLSPVNIHP 

SYAPSSPSSSNSGSYKGSDCSPIMRRSGRYMSCGENHGVRPPNP 

EQYLTPLQQKEVTVRHLKTKLKESERRIiHERESBIVELKSQrAR 

MREDWIEEECHRVEAQLALKEARKEIKQLKQVIETMRSSLADKD 

KGIQKYFVDlNIQNKKLESLLQSMEMAHSGSLRDErjCLDFPCDS 

PEKSLTLNPPLDTMADGIjSLEEQVTGEGADRELLVGDSIANSTD 

LFDEIVTATTTESGDLELVHSTPGANVLELLPIVMGQEEGSVW 

eravqtdwpyspaiseliqsvlqklqdpcpsslaspdesepds 

MESFPESLSALWDLTPRKfPNSAILLSPVETPYANVDAEVHANR 

lmreldpaacvesrldgviplarggwrqywsssflvdllavaa 

PWPTVLWAFSTQRGGTDPVYNIGALUZGCCWALHSLRRTAFJR 
IKT 


6526 


2 


2034 


sgragepeewrgrqiidsketwipfnsedsqqleeayssgkgcn 
grwptdggrydvhlgermryavywdelasbvrrctwfykgdkd 

NKYVP YSESFS Q VLSETYMIAVTLDEWKKKLES PNRE II ILHNP 

klmvhyqpvagsddwgstpmeqgrprtvkrgvenisvdihcgep 

LQIDHLVFWHG IGPACDLRFRS I VQCVNDFRS VSLNLLQTHFK 

kaqenqqigrvbflpvnwhsplhstgvdvdlqritlpsinrlrh 

FTNDTILDVPFYUSPTYCQTIVDTVASBMNRIYTLFLQRNPDFK 
GGVS IAGHSLGSLILFDIX.TNQKDSLGD IDSEKGSLNI VMDQGD 
TPTLEEDLKKriQLSEFFDIFEKEKVDKEAIiALCTDRDLQEIGIP 
LGPRKKILNYFSTRKNSMG r KRPAPQPASGANI PKESEFCSSSN 
TRNGDYLDVGI6QVSVKYPRLIYKPEIFFAFGSPXGMFLTVRGL 
KRIDPNYRFPTCKGFFNIYHPFDPVAYRIEPMWPGVEFEPMLI 
PHHKGRKRMHLELREGLTRMSMDLKIJNLIiGSrjRMAWKSFTRAPY 
PALQASETPEETEABPES TSEKPSDVNTEETSVAVKEE VLP I NV 
GMLNGGQRIDYVIiQBKPIESFUEYLFALQSHLCYWESEDTVLLV 
LKEIYQTQGIFLDQPLQ 


6527 


1 


922 


GWVPLLSRILPSDACKIYKQGINIRLDTTtilDFTDMKCQRODLS " 

FIFNGDAAPSESFWLDNEQKVYQRIHHEBSEMBTEEEVDII^MS 

SDIYSATLSTKSISFTRAQTGWIiFREDKTERVGNFIiADFYLVNfG 
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SEQ 
ID 
NO: 


Predicted 

Karji rtrt i ncr 

nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end" 
nucleotide 
location 
cor re spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C^Cysteine, D==Aspartic Acid, E« 
Glutamic Acid, P= Phenyl alanine, G=Glycine, 
K^Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P= Proline, Q=Glut amine, R=Arginine, 
S==Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyro3iiie, X=anknown, **Stop 
Codon, /^possible nucleotide deletion, 
X^'possible nucleotide insertion) 








LVLBSRKRREHLSBEDILRNKAIMESLSKGGNIMEQNPEPXRRQ 
SLTPPPQNTITW3EYISAENGICAPHLGRELVCKESKKTFKATIA 
MSQEFPI/SIELLLNVIiBWAPFKHPNKIiREFVQMKLPPGFPVKL 
DIPVFPTITATVTFQEFRYDSFDGSIFTIPDDYKEDPSRFPDL 


G528 


1 


1073 


LTGPAAAEPRCAADAGMKliALGRRKGVWLRLRKI LFCVLG'CyiA~^ 

IPFLIKLCPGIQAKLIPLNFVRVPYFIDLKKPQDQGLNHTCNYY 

LQPEEDVTIGVWHTVPAVWWKNAQGKDQMWYEDAIiASSHPIILY 

LHGNAOTRGGDHRVELYKVLSSLGYHWTFDYRGWGDSVGTPSE | 

RGMTYDALHVPDWIKARSGDNPVYIWGHSLGTGVAmLVRRLCE 

RETPPDALILBSPFTNIREEAKSHPFSVIYRYFPGFDWFFLDPI 

TSSGIKFANDENVKHISCPLIjIIiHAEDPPWPFQLGRKLYSIAA 

PARSFRDPKVQFVPFHSDLGYRHKYIYKSPELPRILREFLGKSE 
PEHQH 


6529 


363 


2215 


XHIRYWXIGWKTMSCGNEFVETLKKIGYPKADNLNGEDPDWLF 

EGVEDESFLKWFGGNVNEQNVLSERELEAFSiriQKSGKPILEGA 

ALDEAJjKrCKTSDLICrPRLDDKEXEKLEDEVQTLLKLKNLKIQR 

RNKCQI,MASVTSHKSIiRLNAKEEEATKKLKQSQGII,NAMITKIS 

NELQALTDEVTQLMMPFRHSNLGQGTKPLVFLSQFSLEKYLSQS 

EQSTAALrLYTKKQFFQGIHEWESSNESQPPNFIjKIQTPS 1 CD 

NQE2LEERRLEMARU3LAYICAQHQLIHLKASNSSMKSSIKWAE 

BSLHSLTSKAVDKENLDAKISSLTSEIMKLEKEVTQIKDRSLPA 

WRENAQLLKMPWKGDFDLQIAKQDYYTARQELVLNQLIKQKA 

SFELLQLSYEIELRKHRDIYRQLENLVQELSQSNMMLYXQLEML 

rDPSVSQQlNPRNTIDTKDysrHRLYQVLBGENKKKEt»FLTHGN 

LBEVAEKLKQNISLVQDQLAVSAQEHSFFIiSKRNKDVDMLCDTL 

YQGGNQLLLSDQELTEQPHKVESQLNKLNHULTDILADVKTKRK 

TLANNKLHQMEREFYVYFLKDEDYLKDIVENLETQSKIKAVSLE 
D 


6530 


128 


2986 


GAAHHGAIVQVHPLLPGSSTIMIHDLCLVFPAPAKAVVYVSDIQ 

ELYIRWDKVEIGKTVKAYVRVLDLHKKPPLAKYFPFMDLKLRA 

ASPIITLV7VLDEALDNYTITPLIRGVAIGQTSLTASVTNKAGQR 

INSAPQQIEVFPPFRLMPRKVrLLIGATMQVTSBGGPQPQSNII, 

FSISNESVALVSAAGLVQGIAIGNGTVSGLVQAVDAETGKWII 

SQDIiVQVBVLLLRAVRIRAP IMRMRTCTQMPI YVTG ITNHQNPP 

SFGNAVPGLTFHWSVTKRDVLDLRGRHHEASIRnPSQYNFAMNV 

LGRVKGRTGLRAWKAVDPTSGQLYGLARELSDEIQVQVFEKLQ 

LLNPBIEAEQILMSPNSYIKLQTNRDGAASuSYRVLDGPEKVPV 

VHVDEKOFLASGSMIGTSTIEVIAQEPFGANQTIIVAVKVSPVS 

YLRVSMSPVLHTQNKSALVAVPLGMTVTFTVHFKDNSGDVFHAH 

SS\^FATNRDDFVQIGKGPTNWrCVVRTVSVGLTLLRVWDAKH 

PGLSDFMPLPVLQAISPELSGAMWGDVLCLATVLTSLEGLSGT 

WSSSANSILHIDPKTGVAVARAVGSVTVYYEVAGHIiRTYKEVW 

SVPQRIMARHLH PIQTSFQEATAS KVI VAVGDRSSNIiRGECTPT 

QREVIQAIiHPETLISCQSQFKPAVFDPPSQDVFTVEPQFDTALG 

QYFCSITMHRLTDKQRKHLSMKKTALWSASLSSSHPSTEQVGA 

EVPFSPGLFADQAEILLSNHYTSSEIRVFGAPEVLBNLEVKSGS 

PAVIAFAKBKS FGWPSPITYTVGVLDPAAGSQGPLSTTLTPS SP 

VTNQAIAIPVTVAFWDRRGPGPYGASLFQHFLDSYQVMFFTLF 

ALLAGTAVMI lA YHTVCrPRDLAVPAALTPRASPGHS PHYFAAS 

SPTSPWALPPARKASPPSGLWSPAYASH 


6531 
6532 


845 

2 


1425 

. 

954 


PSASIPPSASPDPVPDIRTCHFCLVEDPSVGCISGSEKCTISSS" 
SLCMVITIYYDVKVRFJVRGCGQYISYRCQBKRNTYFAEYWYQA 
QCOQYDYCMSWSSPQLQSSLPEPHDRPLALPLSDSQIQWFYQAI, 
eTLSLPLPKrFHAGTEPDGLDPMVTLSLNLGLSFAELRRMYLFLNS 
SGLLVLPOAGLLTPHPS ( 

aAGPPSEVVNQLJSLFPEPEPGPAPQVLLGPQGPGLIKGVAPrari 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid ' 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptfSe 
(A^lanine, C=Cysteine, D=.A9partic Acid, E« 
Glutamic Acid, P= Phenylalanine, G=Glycine^ 
H=Histidine, I^Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, Na^paragiiie , 
P=Proline, Q=^lutamine, R^rArginine, 
S==Serine, T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 
Codon, /=possible nucleotide deletion, 
\«possible nucleotide insertion) 








ITDSTGTHLVLTVTNKNAHSPGLSRGSPCK3PSSQPGSPAPAPSA 
QMDLEHPLQPLFGrPTSLLKKEPPGYEEflMSQQPKQQENGSSSQ 
QMDDLFDILIQSGEISADFKEPPSLPGKEKPSPKTVCWSPLAAQ 
PSPSABLPQAAPPPPGSPSLPGRLEDFLESSTGLPLLTSGHDGP 
EPLSLrDDLHSQMLSSTAIIiDHPPSPMDTSELHPVPEPSSTMGL 

DLADGHLDSMDNLELSSGCPVLSIAPLSTTAPSLFSTDFIiDGHD 
LQLHWDSCL 


€533 


1798 


373 


STISWLARVEPPRRSSGVGAARLRFPGGSRPIiRARACVLALAVL * 
ALLERWNADSMSAHSMriCERrAIAKEIiIKRAESLSRSRKGGIEG 
GAKLCSKLKAELKFLQKVEAGtCVAIKESHLQSTNLTHLRAIVES 
AENLEE WS VLH VFGy TDTLGEKOTLWDWANnflHTfcnrti- a rn-o 

KABALHWIWLGRGQYGDKSIIEQAEJDPIiQASHQQPVCysWPHri 
FAFYNSVSSPMAEKLKEMGISVRGDIVAVNALLDHPEELQPSES 
ESDDEGPELLQVTRVDRBKI tiASVAFPTBIKVDVCKRVNLDI TT 
LITYVSALSYGGCHFIFKEKVLTEQAEQERKEQVLPQLEAFMKD 
KELFACESAVKDFOSILDTLGGPGERERATVLIKRINWPDQPS 
ERALEUjVASSKINSRSLTI PGTGDTLKAITMTAN<ir!??\7P a awwo 

GVKFSVFIHQPRALTESKEALATPIjPKDYTTDSEH 


6534 
■ 6535 


47 


596 


KATRFISAAFWLNKQGVSPAKLPHTSWSWSLQTLSPriPSGDLA" 

EKSLQCFPCSAMLLELIPLLGrHFVLRTARAQSVTQPDIHITVS 

EGASUELRCNYSYGATPYLPWMERTVEBAFIIiLVCLKPWRVASS 

LEKKEKEDESFQLLLGSRYlTVLKAHCLIiPLIRWLTSGDSLLSAO 
PHCPQGL 




250 


964 


LIKTFFRDVAIQRDIjLPKBKNLETLLTliAFbEIDKAt^SSHARLS^ 
ADATLLTSGTTATVALLRDGIELWASVGDSRAILCRKGKPMKL 
TIDHTPERKDEKERI KKCGGPVAWNSLGQPHVNGRLAMTRS IGD 
IiDLKTSGVI AEPETKRIKLHHADDSFLX^TTDGINPM VNSQBI W 
DFVNQCHDPNEAAHAVTEQAIQYGTEDNSTAVWPFGAWaKYKN 
SEINFSFSRSPASSGRWA 


653€ 


242 


1174 


SLVKEMTUQYGILFKQEQAHD0AIWSVAWGTNKKENSETVVTGS 
LDDLVKVWKWRDERLDLQWSLEGHQLGWSVDISHTLPIAASSS 
XjDAHIRLWDLENGKQIKSIDAGPVDAWTLAFSPDSQYLATGTHV 
GKVNIFGVESGKKEYSLDTRGKFILSIAYSPDGKYLASGAIDGI 

inipdiatgkllhtleghampirsltfspdsqllvtasddgyik 
lydvqhanlagtlsghaswvlnvapcpddthfvssssdksvkvw 
dvgtrtcvhtffdhqdqvwgvkyngngskivsvgddoeihiydc 

PI 


6537 


1638 


921 


wrfkppptqgpdpslvyrpdvdpevakdkasfrnytsgplldrv 
pttyklmhthqtvdfvrskhaqfggfsykkmtvmeavdlldglv 
desdpdvdfpnsphafqtaegirkahpdkdwfhlvgllhdlgkv 
ialpgepqwawgdtfpvgcrpqaswfcdstfqdnpdlqdpry 
stelgmyqphcgldrvlmswghexsearggqwgggqrwgtvggog 

AEAVPAGDTIaS PQSTCTR 


6538 
6539 


3345 
218 


2412 
339 


P YLYDFLDAL ITCQTAPEEAFI KLDGLAGMLTSQLRRLTKQVQE 

arhnrddeaikkavneydetmekyxpvlmaoakiywnlenypmv 
ekifrksvefcndhdvwklnvahvlfmqenkykeaigfyepivk 
khydwrlnvsaivlawlcvsyimtsqnekaeblmrkiekeeeql. 

SYDDPNRKMYHLCIVNLVIGTLYCAKGNYEFGXSRVIKSIiEPYN 
KKLGTDTWyYAKRCFIjSI*LENMSKHMIVIHDSVIQECVQPLGHC 
ELYGTNI PAVI EQPLEEERMHVGICNTVTDESRQLKAL I YE I IGW 
NK 

flgaasphphfsslaphpdqpeftpvqdeleamelwgpgv 1 


6540 


3 


391 


lerlwllliirrpedamaecptlgeavtdhpdrlwawekfvylde 
kqhawlpltieikdrlqlrvllrredwlgrpmtptqigpsllp 
rmwqlypdgryrssdssfwrlvyhikidgvbtmiilellpdd 
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SEQ 
ID 
NO: 


Predicted 
beginning 
THicleohi f^*i 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

J. UC-d t. XUXl 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=>Alanine, C=Cyateine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H=iHistidine* I=Isoleucine, K=Ly*sine, 
Ii=Leucine, M^Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R-Arginine, 
S^Serine, T=Th.reonine, V=Valine, 
WttTryptophan, Tf=Tyrosine, X=Unknown, *=Stop 
Codon, /^.possible nucleotide deletion, 
\=spoQgible nucleotide insertion) 


£541 


1165 


536 


RTLVQRRIIiMtjLRKPARGRDLRGRGRGTPRGGRKGLLPTPDEFP 
R FEGGRKPDS WDGNREPGPGHEHFRDTPR PDHPPHDGHS PAS RE 
RSSSLQGMDMASXiPPRKRPWHDGPGTSEHREMEAPGGPSEDRGG 
KGRGGPGPAQRVPKSGRSSSXjDGEHHDGYHRDEPFGGPPGSGTP 
SRGGRSGSNWGRGSNMNSG PPRRGASRGGGRGR 


6^42 


3 


3775 


SWPRGRGETGGHPGALRTRTMQKSVRYNBGHALYIiAFIiARKEGT 
KRGFIiS KKTAEASRWHBKWPALYQNVLFYFEGEQS CRPAGMYLL 
EGCSCERTPAPPRAGAGQGGVRDALDKQYYFTVLFGHEGQKPLE 
XjRCEEEQDGKEWMBAIHQASYADILIEREVLMQKYIHLVQIVET 

ekiaanqlrhqledqdteierlkseiialnktkbrmrpyqsnqe 

DEDPDIKK I KKVQSFMRGWLCRRKWKTI VQD YI CS PHABSMRKR 
NQIVPTMVEAESEYVHQLYILVKGFLRPLRMAASSKKPPISHDD 
VSS IPLNSETXMFIiHE I FHQGLKARI ANWPTLILADLFDILLPM 
LNIYQBFVRNHQYSIKSVIANCKQNRDFDKIiLKQYEANPACEGRM 
IiETFLTYPMFQI PRYI iTLHEIiLAHTPHEHVBRKSLBFAKSKIiE 
ELSRVMHDEVSDTBNIRKKfLAIERMI VEGCD I LLDTSQTFIRQG 
S LI QVPSVERGKLS KVRLGSLSLKKEGERQCPLFTKHFLI CTRS 
SGGKLHLLKTGGVLSLI DCTLIEE PDASDDDS KGSGQVFGHXJ3F 
KIWEPPDRAAFTWLLAPSRQEKAAWMSDISQCVDNIRCNGLM 
TIVFEENS KVTVPHMI KSDARLHKDDTDICFS KTUNSCKVPQIR 
Y7VSVERLLBRl.TDLRFLSIDPLNTFLHTYRIFTTAAWlrGKLSD 
lYKRPFTSIPVRSLBLPFATSQNNRGEHLVDGKSPRrjCRKFSSP 
PPLAVSRTSSPVRARKLSLTSPLiNSKIGALDLTTSSSPTrTTQS 
PAASPPPHTGQI PLDLSRGLSSPEQS PGTVEENVDNPRVDLCNK 
LKRSIQKAVLESAPADRAGVESSPAADTTELSPCRSPSTPRHLR 
YRQPGGQTADNAHCSVSPASAFAIATAAAGHGSPPGFNNTERTC 
DKEF I IRRTATKRVLNVLRHWVSKHAQDFELNNELKMNVIiNLLE 
EVLRDPDLLPQERKAAANIIMALSQDDQDDIHIiKIiEDIIQMTDC 
MKAECFESLSAMEJLAEQITIjIiDHVIFRSIPYEEFI*GQGWMKIjDK 
NERTPYIMKTSQHPNDMSNLVASQIMNYADVSSRANAIEKWVAV 
ADI CRCLHNYNGVLE ITSALNRSAI YRLKKTWAKVSKQTKAIjMD 
KLQKTVSSEGRFKNIjRBTLKNCWPPAVPYIiGMYLTDLAPIEEGT 
PNFTEEGIiVNFSKMRMISH IIREIRQPQQTS YRI DHQPKVAQYL 


6543 


1857 


950 


PVS GCGRAG IGLSWAMAAEARVSRWYFGGLT^CGAACCTHPIiDL 
LKVHI^TQQEVKLRMTGMAIjRVVRTDGIIiALYSGLSASIiCRQiyiT 

TPADLVNVRMQNDVKLPQGQRRNYAHALDGLYRVAREEGLRRLF 
SGATMASSRGALVTVGQIiSCYDQAKQLVLSTGYLSDNIFTHFVA 
SFIAGGCATFLCQPLDVLKTRLMNSKGEYQGVFHCAVETAKLGP 
LAFYKGLVPAGIRLIPHTVLTFVPLEQIiRKNFGIKVPS 


6544 


630 


79 


PSPCFI RSRIiDGQPWMAGLEAWLSQWFSIiHQPQSRVRVRRAS I S 
BPSDTDPEPRTLNPSPAGWPVQQHPELBLMSSFRERFGRNWLQY 
RSHLEP S GNPIiPATPTTSAPSAP PASSQGPDTAPRPSPPQEEAR 
GPQESPQKMSEEVRAEPQEEEEEKBGKEEKEEGEMAPLPEAHLG , 
EGKQKECP 


6545 


176 


560 


PPHSHAALLPAAMTPLIiTLILWLMGLPLAQALDCHVCAYWGDN 
CFNPMRCPAMVAYCMTTRTYYTPTRMKVSKSCVPRCFETVYDGY 
SKHASTTSCCQYDLCNGTGLATPATIiALAPILLATLWGLL 


6546 


1657 


364 


HLIiNGLDEVAAFFVADIiGAIVRKHFCFLKCLPRVRPFYAVKCNS 
SPGVLKVLAQLGLGFSCANKAEMELVQHIGIPASKIICAKTPCKQ 
lAQIKYAAKHGIQLLSFDNEMELAKWKSHPSAKMVLCIATDDS 
HSLSCLSIiKFGVSLKSCRHLLENAKKHHVEWGVSFHIGSGCPB 
PQAYAQSIADARLVFEMGTELGHKMHVLDLGGGFPGTEGAKVRF 
EElASVIISrSALDLYFPEGCGVDIFAEUSRYYVTSAPTVAVSIIA 
KKE VLLDQPGREEENGSTSKTi VYHLDEGV YG I FNS VLPDNI CP 
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SEQ 
ID 
WO: 


~i Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containina sianaJ r^f^r^t-A^^ 
lA=Alanine, C^Cysteine, D^Aspartic Acid, e= 
Glutamic Acid, P^Phenylalanine, G=Glycine 
H«Histidine, I=Isoleucine, K=Lysine, 
L^Leucine, M=Methionine, WsAsparagine , 
PsProline, Q=Glut amine, R=Arginine, 
S^Serine, T=ThreoninQ, V-Valine, 
"^Tryptophan, Y«Tyrosine, X^Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 


6547 






iPlLQKKPsTEQPLYSSSLWGPAVDGCDCVAEGLWLPQLHVGDW 
IiVFDNMGAYTVGMGSPFWGTQACHITYAMSRVAWEALRRQLMAA 
EQEDDVEGVCKPLSOGWEITDTLCVGPVFTPASIM 


6548 


1 


541 


lhskylapai^csqpgmmrccrrrcccrqpphalrpllLlplvll 

PPLAAAAAGPNRCDTIYQGPAECLIRLGDSMGRGGELETICRSW 

ndfhacasqvlsgcpeeaaaweslqqearqaprpnwlhtlcga 

P VHVRERGTGS EXNQBTLRATAPALPMAPAPPLLAAALAIAYLL 
RPLA 


6549 


2 


219 


FVSRLSVRDVRFPTFLGCiHfi&n&MU'pnonvc' a nwrr^ vi^»it-.-;> r^r-^^ — 
— .^..^ J.4?AJS3onva/iL;/U,i'lllJ.JUjfU3[*>AAyvPlETDAEDG 

IKGCGITFTLGKGTEVGELKILSRFQNA 


6B50 


73 


1490 


BTGRVCEDARPACGSRSRRRRKEAAPGIPTPSPSSSSPTSSRPA 
ARAFSKAPARLSRPRAREEPPDPGRRYIQEEIIQARKHKIilKMC 
SSVAAKLWFLTDRRIREDYPQKEILRALKAKCCEEELDFRAWM 
DBVVLTIEQGNLGLRINGELITAYPQ\AAA^RVPTPWVQSDSDIT 
VLRHLEKMGCRLMlSrRPQAILNCVHKPWTPQELAGHGVPLPDTFS 
YGGH2NFAKMIDEAEVLEFPMWKNTRGHRGKAVFLARDKHHLA 
Dr.SHI,IRHEAPYLFQKYVKESHGRDVRVIWGGRWGTMLRCST 
DGRMQSNCSLGGVGMMCSI^EQGKQIIAIQVSWILG^aDVCGIDIJ^ 
MKDDGSFCVCEANANVGPIAPDKACNLDVAGI lADYAASLLPSG 

RLTRRMSIjLSVV'STAS"PT^B''DT?T^t5rinc!n^'nTrT^»Tiut« »v 

V vo Artoii iCja±:'iSLGFPASTAVDKI^SASSSSVDSD 

PBSTERELLTKLPGGLFNMNQLLANEIKLLVD 


6551 


2293 


922 


i^KVSRIXSAPlJQ.'CSIEQMGLAMKHGGSYARAGGSSR^CWYYLRYPF 

LFVSLIQFLIILGLVLFMVYGNVHVSTESNLQATERRAEGLYSQ 

LLGLTASQSNLTKELNPTTRAKDAIMQMWLKARRDLDRINASFR 
QCQGDRVIYTNNQRYMAAI ILSEKOrT?mT7PfnMMW"c3 nn» T T ciurp 

NQK^TKTLEVEIAKEKTICTKDKKSVLIiNKRVAEEQLVECVKTRE 
LQHQERQLAKEQLQKVQALCLPLDKDKFEMDLR])JLWRDS 1 1 PRS 
LDNLGYNLYHPLGSBLASIRRACDHMPSLMSSKVEELARSLRAD 
lERVARENSDLQRQKLEAQQGLRASQEAKQKVEKEAQAREAKLQ 
AECSRQTQLALEEKAVLRKERDNLAKELEEKKREAEnT.PMPT &t 

RNSALDTCIKTKSQPMMPVSRPMGPVPNPQPIDPASI.EBFKRKI 
LESQRPPAGI PVAPSSG 


65S2 


157 


748 


igPPDPRNMTiAAYKEKMKELPLVSLiFCSCFriADPLNKSSYKYE 
ADTVDLNWCVISDMEVIELNKCTSGQSFEVILKPPSFDGVPEFN 
ASLPRRRDPSIiEEIQKKIiEAAEERRKyOEAELLKHLABKREHBR 
EVIQKArEEmWFIKMAKEKLAQKMESNKENRBAHLAAMI^ERLQ 
EKDKHZVEEVRKNKELKEEASR 


65S3 


157 


748 


XQPPDPRNMTIAAYKEKMKELPLVSLFCSGPLADPLNKSSYKYE 
AinVDLNWCVISDMEVIELNKCTSGQSFEVILKPPSPDGVPEFW 
ASLPRRRDPSLEEIQKKLEAAEERRKYQEAELI.KHLAEKREHER 
EVIQKAIEBNNNFI KMAKEKLAQKMESNKBNREAHLAAMI^RLQ 
EKDKHAEBVRKNKELKEEASR 




2 

■ . 


1807 

< 
I 
I 


FVWSKMAAHliSYGRVNLNVLREAVRREIiREFLDKCAGSKAIVWD " 

E YLTGPFGLIAQ YSLLKEHEVEKMFTLKGNRLPAADVKHI I FFV 

RPRLBLMDIIAENVLSEDRRGPTRDFHILFVPRRSLLCEQRLKD 

LGVLGSFIHREBYSLDLIPFDGDUuSMESEGAFKECYLEGDQTS 

LYHAAKGLMTLQALYGTIPQIPGKGECARQVANMMIRMKREFTG 

SQNSIFPVFDNLLLLDRNVDLl^rPLATQLTYEGLIDEIYGlQWS 

YVKLPPEKFAPKKQGDGGKDLPTEAKKLQLNSAEBLYAEIRDKN 

F.WGSVLSKKAKI iSAAFEERttNAKTVGEI KQFVSQLPHMQAA 

RGSLAWHTSIAELIKDVTTSEDFFDKLTVEQEFMSGIDTDKVNU 

SriEDCIAQKHSLIKVLRLVCLQSVCNSGLKQKVLDYYKREILQT 

kTGYEHILTLHNLEKAGLLKPQTGGRNNYPTrRKTLRLWMDDVNE 

2NPTDISYVYSGYAPL3VRLAQt,LSRPGWRSIEEVLRlLPGPHF 

SBRQPLPTQLQKKRQPGENRVTLIFFLGGVTFAEIAAUIFLSQL 

JDGGTEYVlATTKLMNTGrSWIEALMBKPF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal peptide 
{A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, lalsoleucine, K=:Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P= Proline, Q=Glut amine, RsArginine, 
S^Serine, T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, X=Onknovm, +«Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 


6554 


119 


1244 


FEMGSQVSVBSGALHWIVGGGFGGIAAASQLQALNVPFMLVDM 
KDSFHHNVAALRASVETGFAKKTFISYSVTFKDNPRQGLWGID 
LKNQMVLLQGGEALPFSHLILATGSTGPPPGKFNEVSSQQAAIQ 
AYEDMVRQVQRSRFIVWGGGSAGVEMAAEIKTEYPEKEVTLIH 
SQVALADKELLPSVRQBVKEILLRKGVQIiLSERVSNIiEELPLN 
BYREYIKVQTDKGTEVATKLVILCTGIKINSSAYRKAFESRIAS 
SGALRVNEHLQVEGHSNVYAIGDCADVRTPKMAYLAGLHANIAV 
ANI VNS VKQRPLQAYKPGALTFLLSMGRNDGVGQISGFYVGRLM 
VRLTKSRDLPVSTSWKTMRQSPP 


6555 


1552 


498 


ihmallrkikqvllfllivtlcviIiVkkvhkgtvpkndaddese 

TPEEIiEEEIPWICAAAGRMGATMAAINSIYSNTDANILPYWG 
LRNTLTRIRKWIEHSKLREINFKIVEFtTPMGLKGKIRPDSSRPE 
LLQPLNFVRFYLPLLIHQHEKVIYLDDDVIVQGDIQELYDTTLA 
LGHAAAFSDDCDLPSAQDINRLVGLQNTYMGYLDYRKKAIKDLG 

ispstcsfnpgvivanmtewkhqritkqlekwmqknveenlyss 

SLGGGVATSPMLIVFHGKySTINPLWHIRHLGWNPDARYSEHFL 
QEAiCLIiHWNGRHKPWDPPSVHNDLWESWPVPDPAGIFKLNHHS 


6556 


241 


1449 


ASLCKGCFFVTHVLVIILPSLQSPPTFGFLLDIDGVLVRGHRVi" " 

PAALKAFRRLVNSQGQLRVPWFVTNAGNILQHSKAQELSAIiLG 

CEVDADQVILSHSPMKLFSEYHEKRMLVSGQGPVMENAQGLGFR 

NWTVDEURMAFPLLDMVDLERRLKTTPLPRNDFPRIEGVLLLG 

EPVRWETSLQItlMDVLLSNGSPGAGlATPPYPHIiPVIiASNMDLrj 

WMAEAKMPRFGHGTFLLCLETIYQKVTGKBLRYEGLMGKPSILT 

IiQKATHDGAPBLGAGGTRQQQPSASQSCISILVCTGVYNPRWPO 
STEPVLGGGEPPPHGHRDLCPSPGLMEASHWNDVNBAVQLVFR 
KEGWALE 




2598 


1534 


RMCGRTSCHLPRDVLTRACAYQDRRGQQRLPEWRDPDKyCPSYjr 
KSPQSNSPVLLSRLHFEKDADSSEHIIAPMRWGLVPSWFKESDP 
SKLQPWTTNCRSDTVMBKRSFKVPLGKGRRCWIiADGPYEWQRC 
QGTKQRQPYFIYFPQIKTEKSGSIGAW3SPENWEKVWDNWRLLT 
MAGIFDCWEPPEGGDVLYSYTIITVDSCKGLSDIHHRMPAII*DG 
EEAVS KWLDFGE VSTQEALKL I HPTENI TFHAVSS WNNfiRTTKrr 

PECLAPVDLWKKELRASGSSQRMLQWLArKSPKKBDSKTPOKE 
ESDVPQWSSQPLQKSPIiPTKRGTAGLLEQWLKREKEEEPVAKRP 
YSQ 


6558 


21 


1138 


FHGRRRGGRKMELGSCLEGGREAABEBGBPEVKKRfeLLCVEFAS 

VASCDAAVAQCFIiAENDWEMERALNSYFEPPVEESALERRPETI 

SEPKTYVDLTNEBTTDSTTSKISPSEDTQQEN53SMFSLITWNID 

GLDLNNLSERARGVCSYIALYSPDVIFLQEVl'ppyySYLKKRSS 

NYEIITGHSEGYFTAIMLKKSRVKLKSQEriPFPSTKMMRNLLC 

VHV^A7SGNELCLMTSHLESTRGHAAERMNQLKMVI,KKMQEAPES 

ATVIFAGDTNLRDREVTROGGLPNNIVDVWEPLGKPKHCQYTWD 

TQMNSNLGITAACKLRFDRIPFRAAAEEGHIIPRSLDLLGLEKIi 
'ncv20l?'Dcmrijr2T t *^tt ta'ttt 
jwL.wrcr r'aun ryoijijt;;;jjfjLiDIIIi 


6559 
65^0-" 


3 


364 


GPELSGLPTRPKKLKANQTPIAMDCCASRSCSVPTGPATTXCSS 
DKSCaiCGVCLPSTCPHTVWLLEPTCCaDNCPPPCHIPQPCVPTCF 
LLNSCQPTPGLBTLNLTTPTQPCCEPCLPRGC 




3 


1435 


TATSGGIWLRRKWRCHWPRPLPQSCVGTEGGLQVRDTSSRIAKG 
GVDHTKMSLHGASGGHERSRDRRRSSDRSRDSSHERTESQLTPC 
IRNVTSPTRQHHVEREKDHSSSRPSSPRPQKASPNGSISSA£3NS 
SRNSSQSSSDGSCKTAGEMVFVYENAKEGARNIRTSERVTLIVD 
NTRFWDPS IFTAQPNTMLGRMFGSGREHNFTRPNEKGBYEVAE 
GIGSTVPRAILDYYKTGIIRCPDGISIPBLREACDYLCrSFEyS 
riKCRDLSALMHELSNDGARRQFEFYLEEMILPLMVASAQSQER 
BCHIWLTDDDWDWDEEYPPQMGEEYSQIIYSTKLYRFFKYIE 
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SEQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal peptide 
(A=Alanine, C=CVsteine, D-Aspartic Acid, E=. 
Glutamic Acid, F^Phenylalanine, G=Glycine, 
H«Histidine, I^Isoleucine, K^Lysine, 
L^Leucine, M==Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonlne, V=Valine, 
W=Tryptophan, Y=:Tyrosine, X=Unknown, *^Stop 
Codon, /=passible nucleotide deletion, 
\=pcssible nucleotide insertion) 


6561 






JTRDVAjCsvi.KERGLKKIRLGlEGYPTYKEKVKKKPGGRPBVIYN ' 
WQRPFIRMSWEKEEGKSRHVDPQCVKSKSI'mLAAAAADiPQD 
_ QI'WMHPTPQVDBLDILPIHPPSGNSDLDPDAQNPML 


6S62 


3 


1086 


PGRRFRRKESSSSRWFPADCLLGLRGPASSLLSPEPSPSWPSHS 

PCPMAALTDLSFMYRWFKNCNLVGNLSEKYVFITGCDSGFGNLL 

AKQLVDRGMQVLAACPTEEGSQKLQRDTSyRIiQTTLLDVTKSES 

IKAAAQVJVRDKVGEQGLWALVWNAGVGI,PSGPNEV7LTKDDFVKV 

INVNLVGLIBVTLHMbPMVKRARGRWNMSSSGGRVAVIGGG-^C 

VSKPGVBAFSDSIRRELYYFGVKVCIIEPGNYRTAILGKENLES 

RMRKLWSRLPQETRDSYGEDyFRIYTDKLKNIMQVAEPRVRDVI 

NSMEHAIVSRSPRIRYNPGLDAKLLYIPLAKLPTPVTDFILSRY 
JJPRPADSV f^i^-viUFXIiSRY 


6563 


1 


1S62 


WiiTbYDlRAHKAQLLRFFASSDSNKALBQRRTLHTPiCLBHIiDRV 
l.yEWPLGKRSEGVPVSGPMLIEKAKDFYEQMQL.TBPCVFSGGWL 
WRPKARHGlKiCLDASSEKQSADHQAAEQFCAFFRSlAAEHGLSA 
EQVYNADETGLFWRCLPNPTPEGGAVPGPKQGKDRLTVLMCANA 
TGSHRLKPXAIGKCSGPRAFKGIQHLPVAYKAQGNAWVDJCEIPS 
DWFHHIPVPSVREHFRTIGLPEDSKAVLLLDSSRAHPQEAELVS 
SWFTIFLPASVASLVQPMEQGIRRDFMRNPINPPVPLQGPHAR 
YWMNnAIFSVACAWNAVPSHVFRRAWRKLWPSVAPABGSSSEEE 
LEAECFPVKPHNKSPAHILELVKEGSSCPGQLRQRQAASWQVAG 
REAEGGRPPAATSPAEWWSSEKTPKADQDGRGDPGEGEEVAWB 
QAAVAFDAVLRFAERQPCFSAQEVGQLRALRAVPRSQQQVRRRR 
GALGAWKVEAI^EGPGGCGATAQSPLPCSSTAGDK 


6564~" 


1319 


2694 


liAKPAQPVLLREPEGAGPPVPAGHLVHHLQGGHbKKRAHPDLEA 
HEHPbPCDQMFWRQMGGHLRMVEAWSRGWWGIGYDHTAWVYTG 
GYGGGCFQGLASSTSNIYTQSDVKCVHIYENQRWNPVTGYTSRG 
LPTDRYMWSnASGLQECTKAGTKPPSLQWAWVSDWFVDPSVPGQ 
TDQEGMQYASDPPASYHGSKTMKDFVRRRCWARKCKLVTSGPKL 
EVPPXALRDVSI 1 PESPGABGSGHSIALWAVSDKGDVLCRLGVS 
ELWPAGSSWLHVGTDQPFASISIGACYQVWAVARDGSAFYRGSV 
YPSQPAGDCWYHIPSPPRQRLKQVSAGQTSVYALDBNGNLWYRQ 
<5TTPSYPQGSSmHVSNNVCRVSVGPhDQVWVZmKVQG3HShS 

RGTVCHRTGVQPHEPKGHGWDYGIGGGPTOHISVRANATRAPRSS 
SQEQEPSAPPEAHGPVCC 


65S5 


1 


975 


APUSCAI^WSYCWGWSRAMRGCQIiGI^RSSWPGDI^SARlZSQ^ 

KRAABTHFGFE7VSBEEKGGKVYQVFE3VAKKYDVMNDMMSLQI 

HRVWKDLLLWKMHPLPGTQLLDVAGGTGDIAPRFLNYVQSQHQR 

KQKRQLRAQQNLSWEEIAKEYQNEEDSIiGGSRVWCDINKEMriK 

VGKQKAtAQGYRAGIiAWVIiGDAEELPFDDDKFDI YTIAFG I RNV 

THIDQALQEAHRVLKPGGRFLCLEFSQVNNPLISRLYDLYSPQV 

IPVLGEVIAGDWKSYQYLVESIRRPPSQEEFKDMIEDAGFHKVT 
YESLTSGIVAIKSGFKL 


6566 " 


1464 


999 


RSAVANG LTKRRMGLKLNGR YI SIi I IiAvnTavT.voaxn3a»/-<t^r^r< — 

AVFKGFSDCLLKLGDSMANYPQGLDDKTNIKTVCTYWEDFHSCT 

VTALTDCQEGAKDMWDKLRKESKNLNIQGSLFBLCGSGNGAAGS 
OLPAFPVLLVSLSAALATWLSF 




3 


1385 

: 

( 
I 

f 
C 


nyfc:SAQPGGTQPEPGLgARMAlHKALVMCLGLPLgU.FPGAWAQG ' 

^VPPGCSQGtJJPLYYNLCDRSGAWGIVJbEAVAGAGrVTTFVLTI 

ELVASLPFVQDTKKRSLLGTQVPFLLGTLGLFCLVFACVEKPDP 

3TCASRRFLFGVLFArCFSCIiAAHVFALNFLARKNHGPRGWIF 

[VALLLTLVSVIIWTEWLIITLVRGSGEGGPQGNSSAGWAVASP 

lAIAl^n^JDFVMALIYVMrir^GAFLGAWPALCGRYKRWRKHGVFV' 

jLTTATSVAimrVWIVMYTYGNKQHNSPTWDDPTLAIALAAMAW 

IFVLFyVIPEVSQVTKSSPEQSYQGDMYPTRGVOYETXLKEQKG 

)SMFVENKAFSMDEPVAAKRPVSPYSGYNGQLLTSVYQPTBMAL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A»=Alanine, C»Cysteine, D«Aspartic Acid, E= 
Glutamic Acid, F« Phenylalanine, G^=Glycine, 
H=Histidine, I^Isoleucine, K^Lysine, 
L«Leucine, M=Methionine, N=^paragine, 
P=:Prollne, Q=G1 at amine, R=Arginine, 
S a Serine, T-Threonine, V^^Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\apossible nucleotide insertion) 








MHKVPSEGAYDIILPRATANSQVMGSANSTLRAEEWYSAQSHQA 
ATPPKDQKNSQVFRNPYVWD 


6567 


125 


863 


TKRSNLKAYACS IHH IRTMS YVFVNDSSQTNVPLLQACIDGDFN 
YSKRLLESGFDPNIRDSRGRTGLHLAAARGNVDICQLXJIKFGAD 
LLATDYQGNTALHLCGHVDTIQFLVSNGLKIDICNHQGATPLVIi 
AKRRGVNKDVIRLLESLEEQEVKGFNRGTHSKLETMQTAESESA 
MBSHSLLNPNLQQGEGVLSS FRTTWQEFVEDLGFWRVLLL I FVI 
ALLSLGIAYYVSGVLPFVENQPBLVH 


65G9 


3 


11S3 


HASDRLLVLPDNYSHPSQASANL,QGPSRTTELFHPTLAS I SS PM 
LEGAELYFKTVDHGYIjEGLVRGCKASLLO^QDYrNLVQCETLSDL 
KIHLQTTDYGNfFIANHTNPLTVSKIDTEMRKRLCGEFEYFRNHS 
LE PLS TFLTYMTCS YMI DNVI LLMNGALQKKSVKEIIiGKCHPLG 
RFTEMEAVNIAETPSDLFNAILIETPLAPFPQDCMSEKALDELN 
lELLRNKLYKS YliEAFYKFCKNHGDVTAEVMCP ILEFEADRRAF 
IITLNSFGTELSKEDRETLYPTFGKLYPEGLRLLAQAEDFDQMK 
NVABHYGVYKPLFEAVGGSGGKTI.EDVFYEREVQMNVI1AFNRQF 
HYGVFYAYVKtiKEQEIRNIVWXAECISQRHRTKINSYIPIL 


6569 


205 


1532 


RRRGPQRLGHGRPT Phh CRWRTAGPSHWEKQARAFQGLRPVDPR 
RMSWLFPLTKSASSSAAGS PGGLTSLQQQKQRL I ESLRNSHS SI 
AEIQKDVEYRLPFriNNLTININII,LPPQFPQEKPVISVYPPIR 
HHLMDKQGVY VTS PLVNNFTMHSDLGKI iQSIiLDEFWKNPPVIA 
PTSTAFP YLYSNPSQMS P YASQGFPFIiPP YPPQEANRS I TSLS V 
ADTVS SSTTSHTTAKP/\APS?GVLSNLPlJPIPTVr>AS IPTSQNG 
FGYKMPDVPDAFPE1*SELSVSQLTDMNEQEEVLLEQFLTLPQLK 
QIITDKDDI,VKSIEEIiARKNLLLEPSI.EAKRQTVLDKYELLTQM 
KSTFEKKMQRQHELS ESCSASALQARLKVAAHEAEEESDNIAED 
PLEGKMErDDFLSSFMEKRTICHCRRAKSEKLQQAIAMHSQFHA 
PL 


^570 


330 


1304 


ARIiPRLTFLREGPLYVtiLSHP/VFVGAPRPPASDSWKiCGtiVPSAP 
PASRKMGSKALPAPI PtiHPSLQbTETYSFLQAVNTFPATVDHLQG 
liYGLSAVQTMHMNHWTLGYPNVHEITRSTrTEMAAAQGI.VDARF 
PFPALPFTTHIiFHPKQGAIAHVLPALHKDRPRFDPANrAVAATQ 
EDPPKMGDLSKLSPGLGSPISGLSKLTPDRKPSRGRLPSKTKKE 
PICKFCGRHFTKSYNLLIHERTHTDERPYTCDICHKAFRRQDHI* 
RDHRYIHSKBKPFKCQECGKGPCQSRTUVVHKTLHMQTSSPTAA 
SSAAKCSGETVICGGT 


6571 


169 


656 


APDMNTlKKLQKLTDTIiTKNCKHLFRGFDXDNDGCVNVLEWIHGL 
SIiFLRGSLEEKMKYCFEVFDLNGDGFISKEE>5FHMLKNSIiLKQP 
SEEDPDEGIKDLVEITLKKMDHDHDGKLSFADYELAVREETLLL 
EAPGPCLPDPKSQMEFEAQVFKDPNE FNDM 


6572 


49 


1^46 


TPERAQPGALLGAAGCCVCGGRWWPRSHERGYPSSAKMGSKRRN 
LSCSERHQKLVDBNYCKKLHVQALKNVNSQIRNQMVQN3NDNRV 
QRKQFLRLLQNEQFEIiDMEEAIQKAEENKRLKELQLKQEEKLAM 

JSLiAJUjlUlfi &iilUJjK JtUQKQQ VKEN& I 

QXAEKDAI KYEQMKRDAEIAKTMMEEHKRI I KEENAAEDKRNKA 
KAQYYLDLEKQIjEEQEKKKQKAYEQLLKEKLMIDEIVRKIYEED 
QLEKQQKLEKMNAMRRYIEEFQKEQALWRKKKREEMEEENRKri 
EFANMQQQREEDRMAKVQENEEKRLQLQKALTQKLBEMLRQRED 
LEQVRQELYQEEQAEIYKSKLKEEAEKKLRKQKEMKQDFEEQMA 
LKELVLQAAKEEEEKFRKTMLAKFAEDDRI ELMNAQKQRMKQLE 
HRRAVEKIiIEERRQQFLADKQRELEEWQLQQRRCXSFINAIIEEE 
RLKLLKEHATNXiLGYLPKGVFKKEDDIDLLGEEFRKVYQQRSEI 
CEEK 


6573 


767 


275 


GGGGGESQSFRAQDGTRTPATDCLMYLQGPJ^kLMl'CiGGYDMVQK 
LFLDFFRRRLSQRPTAEELBQRNI LKPRNEQEEQEEKRE r KRRL 
TRKLSQRPTVBELRBRKrLXHFSDYVEVADAQDYDRRADKPWTR 
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SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A=Alanine, CoCysteine, D=Aspartic Acid, E» 
Glutamic Acid, P=Phenylalanine, G^Olycine, 
H-Histidine, I=Isoleucine, K=Lyslne, 
L^Leucine, M=Methionine, N=Asparagine, 
P^Proline, Q=Glutamine, R^^rginine, 
S^Serine, T=Threonine, V^Valine, 
W=.Tryptophan, Y=Tyrosine, X^Unknown. *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6574 
6575 


204 


1159 


IiTAADKVSRGECWRVGGRTVCWVSLGSPLGSV ' 

bESSVPVSVGVi^WACGVSWTGAAGLQDGALSDTMARNAEKAMTA 

IARFRQAQLEEGKVKEJ2RPFLASECTELPKAEKWRRQIIOBISK 

KVAQIQNAGLGEPRIRDLNDEINKLLREKGHWEVRIKEa/SGPDY 

GKVGPKMLDHBGKEVPGNRGYKYFGAAKDLPGVRELFEKEPLPP 

PRKTRAELMKAIDFEYYGYLDEDDGVIVPLBQEyEKKLRAELVB 

KWKABREARLARGEKEEEEEEEEEINIYAVTBEESDEEGSQEKG 

GDDSQQKFIAHVPVPSQQElEEAIiVRRKKMELLQKYASETLQAQ 
SEEARRLLGY 


6576 


117 


820 


5PAIASQS^mTEBKMLaPQEWGVIDtPPYEHVEDETFPPFPPP~ 

ASPERQDGEGTEPDEESGNGAPVPVPPKRTVKRNIPKLDAQRLI 
SERGLPAIiRHVPDKAKFXGKGHP'APnT^TrMT TDtjfuin>tir.T-Mtt~.x i-.-r>.» 

IK3FEDPIDRVEYLGS1CKEVQTCLKRIRU)LPILHEDPVSNNDEV 
AENNEHDVTSTELDP PLTNLSESEMFASELS I SLTEBQQQRIER 
WKQLALERRQAKLP ^wuKiKK 


6577 


1 


1060 


PEPQALVGujutUALRLLVARLVLTVSAPAEVkkk^LRPVLSWMD 
RETRAJoADSHFRGLGVDVPGVGQAPGRVAFVSEPGAFSYADFVR 
GFLLPtOLPCVPSSAFTQGWGSRRRWVTPAGRPDFDHLLRTYGDV 
WPVANCXSVQEYNSNPKEHMTLRDYITYWKEYIQAGYSSPRQCL 
YLKDWHLCRDFPVBDVFTLPVYFSSDWnNEPWDALDVBDYRFVY 
AGPAGSWSPPHADIPRSFSWSVNVCCRKKWLLFPPGOEEALRDR 
HGNLPYDVTS PALCDTHIjHPRNOIiAGPPT.T! TTnpari7Mi7i:'i7r>c>r^ 

WHHQVHNLVMCCFSCPLSGAFLQEDGSTTSPLSQPELGWNGVAH 
G 


" 6578 


2271 


987 


bUKMASDDFb.LVXEAMLEAl:>yKi<:EBDEQQRKEVKKDYPSNTT5S 

TSNSGWETSGSSTIGBTSNRSRDRDRYRRRNSRSRSPGRQCRHR 

SRSWDRRHGSESRSRDHRREDRVHYRSPPLATGYRYGHSKSPHF 

REKSPVREPVDNLSPBERDARTVFCMQLAARIRPRDLEDFFSAV 

GKVRDVRIISDRNSRRSKGIAYVEFCBIQSVPXAIGLTGQRLLG 

VPIIVQASOABKiraLAAMANm-OKGNGGPMRLYVGSLHFWITED ' 

MLRGI FEPFGKI DNIVLMKDSDTGRSKGYGF ITFSDS ECARRAL 

BQLNGFELAGRPMRVGHVTERLDGGTDITFPDGDQELDLGSAGG 

RFQLMAKLAEGAGIQLPSTAAAAAAAAAAQAAALQLNGAVPLGA 

LNPAALTALSPALNLASOCLQLSSLPTPQTM 


6S79 " 


377 


1489 


PSSSATMNRAi'iiKKATILHMAIiTGASDPSAEAEANGEKPFLLRA 
LOIALWSLYWVTSISMVFLNKYLLDSPSLRLDTPIFVTFYQCL 
VTTLLCKGLSALAACCPGAVDFPSLRLDLRVARSVLPLSWFIG 
MITFNNLCLKYVGVAFYNVGRSLTTVFNVLLSYLLLKQTTSFYA 
LLTCGI I IGGPWLGVDQEGAEGTLSWLGTVPGVLASLCVSLKAI 
YTrrKVLPAVDGSIWRLTFYNNVNACILFLPLLLLLGBLQALRDP 
AQLGSAHPMGMMTLGGLFGFAIGYVTGLQIKFTS PLTHNVSGTA 

KACAQTVLAVLYYBETKSFLWWTSNNMVIiGGSSAYtWVRGWEMK 
KTPEEPSPKDSEKSAMGV 


6580 


2 


711 


Ki^PKVWYPBLi<iSJLSAAAPRW$HRTAPGIMVPYFTSSSVNSSA?F~ 

X i ravjRij A. I iirgtuiji KHGWPEDI WFH VDKLSSAHVYLRLHKGEK I 

EDIPKEVIJ^DCT^VKANSIQGCKMNNVlfVVYTPWSNLKKT^ 

DVGQIGFHRQKDVKIVTVEKKVNEILNRLEKTKVERFPDLAAEK 

ECRDREERNEKKAQIQEMKKREKEEMKKKREMDELRSYSSLMKV 
BNMSSNQDGNDSDBFM 




62 


1571 ] 
1 

: 

J 
C 


LiVAIiKNWKPKGTNl PAPQSPVFGEAVSGVYMMTKVLGMAPVflS?- 

iPPQEC2VGPLMVKVEEKEEKGKYLPSLEMPRQRFRQFGYHDTP6 

'REALSQLRVLCCEPJLRPEIHTKEQILELLVLEQFLTILPQBLQ 

WQEHCPESAEEAVTLLEDLBRELDEPGHQVSTPPNEQKPVWE 

CrSSSGTAKESPSSMQPQPLETSHKYESWGPLYIQESGEEQEFA 

JDPRKVRDCRLSTQHEESADEQKGSEAEGLKGDI IS VI lANKPE 

^LERQCVWIiENEKGTKPPIiQEAGSKKGRESVPTKPTPGERRYl 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleoticle 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
seqfuenoe 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid. E= 
Glutamic Acid, F^^Phenyl alanine, G=Glycine, 
H=Histidine, I^slsoleucine, K^Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y^iyrosine, X«Unknown, ♦-Stop 
Codon, /^possible nucleotide deletion, 
_ \=po33ible nucleotide insertion) 


6581 






CAECGKAFSNSSNI,TKHRRTHTGEKPYVCTKCX3KAFSHSSKLTL 

HYRTHLVDRPYDCKCGKAFGQSSDLLKHQRMHTBEAPYQCKDCG 

KAPSGKGSLIRHYRIHTGEKPYQCNECX5KSFSQHAGLSSHQRLH 

TGEKPYKCKECGKAPNHSSNFNKHHRIHTGBKPYWCHHCGKTFC 
SKSNLSKHQRVHTGEGEAP 




228 


476 


RVFLKDLSaTPMASNNTASIAQARKI,VEQLKMEANIDRIKVSKA 
AADLMAYCBAHAKEDPLDTPVPASENPFRBKKFFCAIL 


6582 
6583 


1428 


! 718 


CFTTKTHCSPVSVPYLSPLVLRKELESLLENEGDQVIHTSSFjlf 
QHPIIFWTIiVWYFRRLDLPSNLPGLILTSEHCNEGVQLPLSSLS 
QDSKLVYIQLLWDNIWLHQEPREPLYVSWRNFNSEKKSSLLSEE 
QQETSTLVETIRQSIQHNNVLKPINLLSQQMKPGMKRQRSLYRE 
ILPLSLVSLGRENIDIEAFDNEYGIAYNSLSSEILERLQKIDAP 
PSASVEWCRKCFQAPLI 


6584 


487 


41 


KIFSMTSGKiiKWRCTWRPATALWSASBRLGTSSMHPSPRSIsE?^ 

LSMMLSPIiPSNTRGLSPTALFRSPDSEHATSCPRLHLWRCRAPL 

R5FSPLGRLQVLPRSPLHVHTHNSGKEVLGLQVQRSRSGTGPAC 
SQAGSGAVQGGNWCI F 


6585 


189 


1750 


F£.PMAALGPssgNVTEYVVRVPKNTTKKYNIMAFNAADKVNFAT 
WNQARLERDLSNKKIYQBBEMPESGAGSBFKRKLREEARRKKYG 
IVLKEFRPEDQPWLLRVNGKSGRKFKGIKKGGVTENTSYYIFTQ 
CPDGAFEAFPVHNWYNFTPLARHRTLTAEBAEEEWERRNKVLNH 
FSIMQQRRLKDQDQDEOEBEKEKRGRRKASELRIHDLEDDLEMS 
SDASDASGEEGGRVPKAKKKAPLAKGGRKKKKKKGSDDEAFEDS 
DDGDFSGQEVDYMSDGSSSSQEEPESKAKAPQQEEGPKGVDEQS 
DSSEESEEEKPPEEDKEEEEEKKAPTPQEKKRRKDSSEESDSSE 
ESDIDSEASSAFFMAKKKTPPKRERKPSGGSSRGNSRPGTPSAE 
GGSTSSTLRAAASKLEQGKRVSEMPAAKRLRLDTGPQSLSGKST 
PQPPSGECTTPNSGDVQVTEDAVRRYLTRKPMTTKDLLKKFQTKK 
TQLSSEQTVNVLAQILKRtOTERKMINDKMHFSLKE 


6536 


3 


1678 


^PJRIfSRIDiJi-VGGDPRAEASCSVLHSKPHAMADSRDPASDOMQ 1 

HWKEQRAAQKADVr.TTGAGNPVGDKLNVITVGPRGPLLVQDWF 

TDEMAHFDRBRIPERWHAKGAGAFGYFEVTHDITKYSKAKVPE 

HIGKKTPIAVRFSrVAGBSGSADTVRDPRGFAVKFYTEDGNWDL 

VGNNTPIPFIRDPrLPPSFIHSQKRNPQTHLKDPDMVWDFWSriR 

PESLHQVSFLFSDRGIPDOHRHMNOYGSHTFKLVNAM3BAVYCK 

FHYKTDQGIKNLSVEDAARLSQEDPDYGIRDLFMAIATGKYPSW 

TFYIQVMTFlfQAETFPFNPFDIiTKVWPHKDYPLIPVGKLVUIRN 

PVNYFAEVEQ XAPDPSWMPPG IKAS PDKMLQGRLPAYPDTHRHR 

LGPNYLHIPVNCPYRARVATJYQRDGPMCMQDWQGGAPMYYPNSP 

GAPEQQPSALEHS IQYSGB VRRFNTANDDNVTQVRAFYVNVI^E 

EQRKRLCENIAGHLKnAQIPIQKKAVKNFTEVHPDYGSHIQALl. 

DKYNAEKPKNAIHTPVQSGSHLAAREKANL 


^587 " 


32 


804 


PLPEQPASSTSTMPVSGTPAPNKKRKSSKLIMBLTGGGQESSGL 
NIKjKKISVPRDVMIiEELSLLTNRGSKMFICLRONf»vtrtfirTvi7ATiJT5 

DVFSDSSMDHFQKFLPTVGGQLGTAGQGFSYSKSNGRGGSQAGG 
SGSAGQYGSDQQHHLGSGSGAGGTGGPAGQAGRGGAAGTAGVGE 
TGSGDQAGGEGKHITVPKTYISPWERAMGVDPQQKMELGIDLIA 
YGAKAEiPKYKSFNRTAMPYGGYEKASKRMTFQMPKV 




75 


1117 

] 
] 

: 
I 
1 


RRVPSLGKMPECWDGEHDIETPYGLLHWIRGSPKGWRPAliiTY" 

SDVGLNHKLCFNTFFNFBDMQEITKHFWCHVDAPGQQVGASQF 

PQGYQFPSMEQLAAMLPSWQHFGFKYVIGIGVGAGAYVLAKPA 

LjIFPDLVEGLVLVNIDPNGKGWIDWAATKLSGLTSTLPDTVLSH 

[iFSQEELVNNTELVQSYRQQIGNWNQANLQLPWNMYNSRRDLD 

CNRPGTVTNAKTLRCPVMLVVGDNAPAEDGVVECNSKLDPTTTr 

?LKMADSGGLPC3VTQ?GKLTEAPKYPLQGMGYMPSASMTRIARS 

ITASLTSASSVDGSRPQACTHSESSBQLGQVNHTMEVSC 
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SSQ 
ID 
WO: 

6588 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal oeptide 
(AoAlanxne, C-Cysteine, D^partic Acid, E== 
Glutamic Acid, P«.Phenyl alanine, G=Glycine 
H«Hietidine, I-Isoleucine , K^Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P«Proline, Q=Glutamine, Ri=Arginine, 
S=Serine, T« Threonine, V=Valine, 
W«rrypcophan, Y=»ryrosine, X^Unknovn, *-Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6589 


13 7 


501 


IlGLQAQI,Ll5:JRT^^KyQLSDELRKNGVELTSLRQKYAYLDKEFSK 
AQKALSKSKKAQEVEVLLSENEMLQAKLHSQEEDFRLQNSTLMA 
EFSKLCSQMEQLEQENC?QLKEGAAGAGVAQAGP 


6590 


2 


1405 


KPWGSAMATKSRQEFFQQLtQdeLLPTAQQGLDQIWLLLAICLA 
CRLLWRLGLPSYLKHASTVAGGFFSLYHPFQLHMVWWLI^LLC 
YLVLFLGRHSSHRGVFLSVTILIYLLMGEMHMVDTVTWHKMRGA 
QMIVA^3KAVSLGFI>LDRGEVGTVPSPVEFMGYLYFVGTIVFGPW 
ISPHSYLQAVQGRPriSCRWLQKVARStAlALLCLVI,STCVGPYL 
FPYFIPLNGDRLLRNKKRKARGTMVRWLRAYESAVSFHKSNYFV 
GFLSEATATLAGAGFTEEKDHLEWDLTVSKPLKrVEI.PRSMVEW 
TSWNLPMSYWLNNYVFKNALRLGTFSAVLVTYAASALLHGPSPH 
LAAVLLSrAPITYVEHVLRKRLARILSACVLSKRCPPDCSHQHR 
LGLGVRAIJTLLFGAIiAIFHLAYLGSLFDVDVDDTTEEQQYGMAY 
TVHKWSELSWASHWVTPGCKIFYRLIG 


6591 


2177 


656 


VRAYSHVXiSljLBNVFTPMFCHRDEYFRQLLRGAESPTRNSKIiNR 
GSLSLDDFRNTQKRGESFGISRIGSKIKGVFKSTTMEGAMLPNY 
GVAEGBDDFIEEGIWMEDDSPVEAVSTPWTPRNLAAWKISIPY 
VDFFEDPSSERXBKKERIPVFCIDVERNDRRAVGHEPEHWSVYR 
HYLEPYVLESKLTEFHGAPPDAQLPSKRIIGPKETYEFLKSKREE 
FQEYLQKLLQHPELSNSQLLADPLSPNGGETQPLDKILPDVNLG 
KIIKSVPGKLMKEKGQHLEPFIMNPINSCESPKPKPSRPELTIL 
SPTSEl^KKLFNDLFKNNANRAENTERKQNQITYFMEVMTVEGVY 
DYLMYVGRWFQVPDWLHHLLMGTRILFKNTLEMYTDYYLOCKL 
EQLFQEHRLVSLITLLRnAlFCENTEPRSLQDKQKGAKQTFEEM 
MNYIPDLLVKCIGEETKYESIRLLPDGLQQPVLNKQLTYVLLDI 
VI QELFPELNKVQKEVTS VTS WM 


6592 


2177 


656 


VRAYEHVLSH.ENVFTPMFaiRDEYFRQX,LRGAE5PTRNSKIJ?R" 

GSLSLDDFRHTQKRGBSFGISRIGSKIKGVFKSTTMEGAMLPNY 

GVAEGEDDFIEEGIWMEDDSPVEAVSTPNTPRNLAAWKISIPY 

VDFFEDPSSBRKEKKERIPVFCIDVERNDRRAVGHEPBHWSVYR 

RYLEPYVLESKLTEFHGAFPDAQLPSKRIIGPKNYEFLKSKREE 

FQEYLQKLLQHPBLSNSQLLATFLSPNGGETQFI^KILPDVNLG 

KIIKSVPGICLMKEKGQHLEPFIMNFINSCESPKPKPSRPELTIL 

SPTSENNKKLFmSLPKNKANRAENTERKQNQNYFMEVMTVEGVY 

DYLMYVGRWFQVPDWLHHLLMGTRriiFKNTLEMYTDYYLQCKL 

EQLFQEHRLVSLXTr.LRnAIPCENTEPRSLQDKQKGAKQTFEEM 

MNYIPDLLVKClGEETKYESIRLLFDGIiQQPVtiNKQLTYVLLDI 

VIQELFPELNKVQKEVTSVTSWM 


6593 


3 


1861 

; 


APEPLGSTrS&GSMIDANLKLiiQKAEQRIiKAIVAEKFAIAT'KEG 
DLPQVERFFKIFPLLGLHEEGLRKFSEYLCKQVASKAEENLLMV 
LGTDMSDRRAAVI FADTLTLLFEGIARXVETHQPIVETYYGPGR 
LYTLIKYLQVECDRQVEKWDKPIKQRDYHQQFRHVQNNLMRWS 
TTEKIEPRELDPILTEVTLMNARSELYLRFLKKRISSDFEVGDS 
MASEBVKQEHQKCLDKLtiNNCLLSGTMOELIGLYVTMPPVPMor. 

TVNKAVAIiDTYEKGQLTSSMVDDVFYIVKKCIGRALSSSSIDCL 

CAMINLATTELESDFRDVLCNKLRMGFPATTFQDIQRGVTSAVN 

IMHSSLQQGKPDTKGIESTDEAKMS FLVTLNNVE VCSENISTLK 

KTLESDCTKLFSQGIGGEQAQAKFDSCLSDLAAVSKrKFRDLLQE 

SLTELNSTAIKPQVQPWINSFFSVSHWIEEEEPNDYEANDPWVQ 

^FILWLEQQMAEFKASLSPVrYDSLTGLMTSLVAVELEKWLKS 

rFNRLGGLQPDKEIiRSLlAYLTTVTTWTIRDKFARLSQMATILN 

^ERVTEILDYWGPNSGPLTWRLTPABVRQVIiALRIDFRSEDIKR 
jRL 




3 


1837 I 

I 


lAFSAGSRRRGLALQRGVLGGLGGYCPCCCRRRGRLLVtLLLVR 
IGGEGGGGRGRGDKRRRRQARRQRRRPEPAEARGGKMADVLSVL 
.QYNIQKKEI\AaCGDEVIFGEFSWPKNVKTNyWWGTGKEGOPR 
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SEQ 
ID 
WO: 


Predicted ' 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aepartic Acid, E== 
Glutamic Acid, F«Phenylal anine , G=aiycinei 
H^Histidine, I»Isoleucine, K^Lysine, 
L«Leucine, MaMethionine, N=Asparagine , 
P=:Proline, Q=Glutamine, R^^Arginine, 
SaSerine, T=Threonine, V^valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possibls nucleotide insertion) 








EYYTLDSILFLLNir/HLSHPVYVRRAATENIPVVRRPDRKDLLG 
YLNGEASTSASIDRSAPLEIGLQRSTQVKRAADEVliAEAKKPRI 
EDEECVRLDKERLAARLEGHKEGIVQTEQIRSLSEAMSVEKIAA 
IKAKIMAKKRSTIKTDLDDDITAIiKQRSFVDAEVDVTRDIVSRE 
RWRTRTTILQSTGKNFSKNI FAI UJSVKAREEGRAPEQRPAPN 
AAPVDPTLRTKQPlPAAYNRYDQERPKGKEETEGFKIDrMGTYH 
GMTLKSVTEGASARKTQTPAAQPVPRPVSQARPPPNQKKGSRTP 
1 1 IIPAATTStilTMLNAKDLLQDLKFVPSDEKKKQGCQRENETL 
IQRRKDQMQPGGTAISVTVPYRWDQPLKLMPQDWDRWAVFVQ 
GPAWQFKGWPWLLPDGSPVDIFAKIKAFHLKYDEVRLDPJXVQKW 
DVTVLELSYHKRHLDRPVFLRVWETLDRYMVKHKSHLRF 


6594 


1 


1096 


EFPGRRFRGSQASPLCATCGPALLRAPTRAAMTRSLFKGNFWSA 
DILSTIGYDNriQHLNNGRKNCKBFEDFLKERAAIEERYGKDLL 
WLSRKKPCGQSEINTLKRALEVPKQQVDNVAQCHIQIAQSLREE 
ARKMBBFREKQKLQRKKTELIMDAIHKQKSLQFKKTMDAKKNYE 
QKCRDKDEAEQAVSRSANLVNPKQQEKLFVKIiATSKTAVEDSDK 
AYMLHIGTLDKVREEWQSEHIKACEAFEAQECSRINFFRKALWL 
HVNQLSQQCVTSDEMYEQVRKSLEMCS IQRDI E YFVNQRKTGQ I 
PPAPIMYENFYSSQKNAVPAGKATGPNLARRGPLPIPKSSPDDP 
NYSLVDDYSLI,YQ 


6595 


57 


781 


PLGTMSDSDLGEDEGLLSLAGKRKRRGNLPKESVKILRDWLYXH 
RYNAYPSEQEKLSLSGQTNLSVLQICNWFlNARRRIiLPDMLRKD 
GKDPNQFTI SRRGGKASDVALPRGSS PSVLAVSVPAPTNVLSLS 
VCSMPLHSQQGEKPAAPFPRGELESPKPLVTPGSTLTLIiTRAEA 
GSPTGGLFNTPPPTPPEQDKEDFSSFQLLVEVALQRAAEMELQK 
QQDPSLPLLHTP IPLVSENPQ 


6596 


2 


1026 


PRLPVRRYHGHRRLQGRSRGHMAEGDAGSDQRQNESIEAMAAIY 
GEEWCVIDDCAKIFCIRISDDIDDPKWTLCLQVMLPNEYPGTAP 
PIYQLNAPWLKGQERADLSNSLEEIYIQNlGESIIiYLWVEKIRD 
VLIQKSQMTEPGPDVKKKTSEEDVECSDDLIItACQPESSVKALD 
PDI SETRTEVE VEELPP IDHGIP ITDRRSTFQAHLAPWCPKQV 
KMVLS KLYENKKIASATHNI YAYR I YCEDKQTFLQDCEDDGETA 
AGGRLIJHLMEILNVK^mlVVVSRWYGGIUJGPDRFKHINNCARN 
r LVEKNYTNS PEESSKALG KNKK VRKDKKRNEH 


6597 


2 


1026 


PRLPVRRYHGRRRLQGRSRGHMAEGDAGSDQRQNEEIEAMAAIY 
GEEWCVIDDCAKIPCIRISDDIDDPKWTLCLQVMLPNEYPGTAP 
PI YQLNAPWLKGQER7U5LSNSLEEI YIQNIGES ILYLWVEKI RD 
VL 1 QKSQMTBPGPDVKKKTEEBDVECEDDLILACQPESS VKALD 
FDI SETRTEVEVEEIiPP IDHGI PITDRRSTFQAHIiAP WCPEQV 
KMVLS KLYENKKIASATHNIYAYRIYCEDKQTFLQDCEDDGETA 
AGGRLLHLMEILWKNVMVVVSRWYGGILLGPDRFKHINNCARN 
ILVEKNYTNSPEESSKALGKNTKKVRKDKKRNEH 


6598 


1099 


419 


PRVRWATTMAMSFEWPWQYRFPPPFTLQPNVDTRQKQIAAWCSL 

LRKKGNLEWLDKSKSSFLIMWRRPEEWGKLIYQWVSRSGQNKSV 
FTLYELTNGEDTEDEEFHGLDEATLLRALQALQQEHKAE I ITVS 
DGPRRQVUAGTCIiPLLLTSHLSRAFKRRQTQCPPKTGSVTPPD 
SKGLQS 


6599 


164 


1593 


KMAALTTLPKYIDEWQDRYIKKLAKWVArQSVSAWPBKRGEIRR 
MMEVAAADVKQLGGSVELVDIGKQKLPDGSEIPLPPILIiGRLGS 
DPQKKTVCIYGHLDVQPAALEDGWDSEPFTIiVERDGKLHGRGST 
DDKGPVAG WINALEAYQKTGQE I PVNVRFCIiEGMBESGSEGLDE 
LIFARKDTFFKDVDYVCISDNYWLGKKKPCirYGLRGICYFFIE 
VECSNKDLHSGVYGGSVHEAMTDLILLMGSLVDKRGNILIPGIN 
EAVAAVTEEBHKI*YDDIDFDIEEFAKDVGAQILI.HSHKKDrLMH 
RWRYPSLSLHGIEGAPSGSGAKTVIPRKWGKPSIRLVPNMTPE 



52g 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


Fredictecl 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


1 Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=.Cysteine, D=Aspartic Acid, £= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H=:Histidine, I=Isoleucine, K=Lysine, 
LssLeucine, M^Methionine, N=Asparagine, 
P«Proline, Q=Glutatnine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 








WGEQVTSYLTKKFAELRSPNBFKVYMGHGGKPWVSDFSHPHYL 
AGRRAMKTVFGVEPDLTREGGSIPVTLTFQEATGKNVMLLPVGS 
ADPGAHSQNEKLNRYmflEGTKMLAAYLYBVSQLKD 


6600 


2 


934 


PGRIiFRVAAMESAGLEQLLRELLLPDTERI RRATEQLQ I VLRAP 

AALSALCDIiLASAADPQIRQFAAVLTRRRLNTRWRRLAAEQRES 

LKSLILTALQRETEHCVSLSLAQLSATIFRKEGLEAWPQLLQLL 

QHSTHSPHSPEREMGLLLLSVWTSRPEAFQPHHRELLRLLNET 

LGEVGSPGLLPYSLRTLTTMAPYLSTEDVPLARMLVPKlilMAMQ 

TLIPIDEAKACSALBALDELLE5EVPVITPYLSEVLTFCLEVAR 

I!A/AIX5NAIRIRILCCLTFLVKVKSKAI>LICNRL1ATLAAHP?PHC 
GC 


6601 


529 


1420 


PRAAARAPPPAVLRRDRRAATAPGAGEMTLHOPLAQRYFLNHIE 
KITTWQDPRKAMNQPLNHKINIiHPAVSSrPVPQRSMAVSQPlTLVM 
NHQHQQQMAPSTLSQQNHPTQNPPAGLMSKPNAIiTTQQQQQQKL 
RLQRIQMERERIRMRQEELMRQEAALCRQIiPMEAETIiAPVQAAV 
NP PTMTPDMRS I TNNSSDPFIiNGG P YHSREQSTDSGLGL6CYSV 
PTTPEDFLSNVDEMDTGENAGQ-I'PMNINPQQTRFPDFLDCLPGT 
NVDLGTLESEDL I PLFNDVESALNKSE PPLTWL 


6602 


127 


617 


IJIjDFPALPKFVIAQSPKT^KPSTMTSMTQSI^REVIKAKTKARNP 

ervlgkitlvsaapgkvicemkvbbehtnaigtlhggltatiivd 
nistmallctergapqvsvdmnitymspaklgedivitahvlkq 

GKTLAFTSVDLTNKATGKLIAQGRHTKHIiGN 


6603 

cd.e\A 


79 


660 


PVGPSSLAARTGLGHLPFLHRIiASSRGLDMDLUJPLAFr^FVLLL 
SGMGATGTLRTSLDPSLBIYKKMFEVKRRBQLIJUjKNLAQLNDI 
HQQYKILDVMLKGLFKVLEDSRTVLTAADVLPDGPFPQDEKLKD 
AFSHWENTAPFGDWIiRPPRIVHYYPDHNSNWNLLIRWGISFC 
NQTGVPNQGPHSPILSLM 


o ou4 


3 


688 


TSTAQRQGGERMSFRGGGRGGFNRGaGGGGFNRGGSSNHFRGG~ 
GGGGGGNFRGGGRGGFGRGGGRQGFNKGQDQGPPERWLLGEFL 
HPCEDDIVCKCTTDENKVP YFNAPVYLENKEQIGKVDEI FGQLR 
DFYFSVKLSEKMKASSFKKLQKFYIDPYKriLPLQRFLPRPPGEK 

GPPRGGGRGGRGGGRGGGGRGGGRGGGFRGGRGGGGGGFRCGRG 
GGFRGRGH 


6605 


7 


848 


SGSRRGA:iRAAGVGLVDCHCm.SAPDPDRDi:j3DVLEKAKKANVV 
ALVAVAEHSGEFEKIMQLSERYNGFVLPCLGVHPVQGLPPEDQR 
SVTLKDLDVALPI lENYKDRLLAIGEVGLDFSPRPAGTGBQKEB 
QRQVLIRQIQLAKRLNLPVlTVHSRSAGRPTINIiLQEQGAEKVLL 
HAFDGRPSVAMBGVRAGYFFS I PPS I iKSGOQiCLVKQIiPLTS IC 
LETDSPALGPBKQVRNEPWNISXSAEYIAQVKGISVEEVIEVTT 
QNTALKLFPKXRHLLQK 


66 06 


2 


1682 


FVEIRPRAEVANLSAHSASPIODAVLKRLSLLEDIVYRQLNGLS"" 

KSLGLIEGYGGRGKGGIiPATLSPAEBEKAKGPHEKYGYNSYLSE 

KISLDRSIPDYRPTKCKELKYSKDLPQISirPIFVNEAtiSVILR 

SVHSAVWHTPTHLLKSIILVDDNSDEEEIiKTPLEEYVHKRYPGL 

VKWRWQKREGI. IRARI EGWKVATGQVTGFFDAHVEFTAGWAE P 

V1,SRI QEKTRKRVI LPS IDNIKQDNFEVQRYEWSAHGYSWELWCM 

YISPPKDWWDAGDPSLPIRTPAMIGCSFWNRKPFGEIGLLDPG 

MDVYGGENXELGIKVWtiCXSGSMEVLPCSRVAHIBRKKKPYKSNI 

GFYTKRNALRVAEVWMDDYKSHVYrAWWTLPLENPGlDIGDVSER 

RALRKSLKCKNFQWYLDHVYPEMRRYNNTVAYGEtiRNNKAKDVC 

LDQGPLEWHTAILYPCHGtraPQIARYTKEGFLHLGTU^TTTIiLP 

DTRCLVDNSKSRLPQLLETCDKVKSSLYKRWWFIQNGAIMNKGTG 

RCLEVENRGLAGIDLILRSCTGQRWTIKNSIK 


6607 


137 


98^ 

( 


VPACAGIiKKSARS LLASPPRLLNTIOjCSASCRALFS PPIQSRQTT 

3ispqgrggagpgvptrtqvfaamqavmqtfsslqtkqrrpskd 
eciedelemtmvchrpegleqleaqtnftkrelqvlyrgfknecp 



529 



wo 01/53312 



PCT/US00/342(;3 



SEQ 
ID 
NO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
auiJino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide" 
{Ar'Alanine, C=^Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanlne, GsGlyclne, 
H=Histidine, l=lsoleucine, K^Lysine, 
LxLeucine, M=Methionine , N=Asparaglne, 
P=Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threonine, V»Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown, *^=Stop 
Codon, /opossible nucleotide deletion, 
\«possible nucleotide insertion) 


eeo8 






SGWNEDTtKQlYAQFFPHGDASTYAHiTLPNAFDTTQTGSViCPr" 
DFVTAIiSILLRGTVHEKLRWTFNLYDINKDGYINQEEMMDIVKA 
iyDMMGKYTYPVXJCEDTPRQHVDVPFQKMDKNKDGIVTl.DEFl.E 
SCQEDDNID4RSLC2LFQNVM 


6609 


224 


1140 


RPCFSSPTQijjCPRLSYPMIIiLQHAVLPPPKQPSPSPPMSVATRS 
TGTLQLPPQKPFGQEASLPIAGEEELSKGGEQDCALEELCKPLY 
CKLOTVTLNSAQQAQAHYQGKNHGKKLRWYYAANSCPPPARMSN 
WHPAATPWPVPPQMGSFKPGGRVILATENDYCKLCDASPSSP 
AVAQAHYQGKWHAKRLRLAEAQSNSFSESSELGQRRARKEGNEF 
KMMPNRRNMYTVQNNSGPYFNPRSRQRIPRDLAMCVTPSGQFYC 
SMCNVGAGEEMEFROHLESKQHKSKVSEQRYRNEMENLGYV 


6510 


1 


443 


FRLRCRRPRVAGGRLAQAGLRESRVPAPEQRLSALTLLSWSAVT 
PAAEPGNFQLSPAEPRGPLASPVRAAPRAPCPAAEMSELNTKTS 
PATNQAAGQEEKGKAGNVKKAEEEEEIDIDLTAPETEKAAIAIO 
GKFRRFQKRKKDPSS 


6S11 


319 


881 


GRKSLCNLHIFIRFPLTYPDMYMGMMCTAKKCGIRFQPPAIILI 

YESEIKGKIRQRIMPVRNPSKFSDCTRAAEQLKNNPRHKSYLEQ 

VSLRQLEKLFSFLRGYLSGQSLAETMEQIQRETTIDPEBDLNKL 

DDKBLAKRKSIMDELFEKNQKKKDDPNFVYDIEVEFPQDDQLOS 
CGWDTESADBP *rfuui;yi.y£> 


6512 


978 


212 


PGCSGAGSRVWWLPALRHLAMGSTBSSEGRRVSPGVDEEERVRV 
LQGVRLSENWNRMKEPSSPPPAPTSSTFGKJDGNrjRAPHKEST 
LPRSGSSfSGQQPSGMKEGVKRYEQEHAAIQDKLFQVAKREREAA 
TKHSKASLPTGEGSISHEEQKSVRLARELESREAELRRRDTPYK 
EQLERIERKNAEMYKLSSBQFHEAflSKMESTIKPRRVEPVCSGL 
QAQILHCYRDRPHEVLLCSDLVKAYQRCVSAAHKG 


6^13 " 


1724 


992 


VSTHASALSRTQGQPQRQPRAAASGAGAGTAGGGGSG6AEGSKM 
STEAQRVDDSPSTSGGSSDGDQRESVQQEPEREQVQPKKKEGKI 
SSKTAAKLSTSAKRIQKEIiAEITIiDPPPNCSAGPKGD^aYEWRS 
TILGPPGSVYEGGVPFLDITFSPDYPFKPPKVTFRTRIVHCNIN 
SQGVICLDILKDNWSPALTISKVLliSICSLLTDCNPADPLVGSI 
ATQYMTNRAEHDRMARQWTKRYAT 


6614 


130 


74 8 


b'bELSSNMPiigSNDyRVAVFGAGGVGKSSLVUlFVKGTFRESYI 
PTVEDTYRQVlSCDKSrCTLQITDTTCSHQFPAMQRLSISKGKA 
PILVYSITSRQSLEELKPIYEQICEIKGDV3SIPIMLVGNKCDE 
SPSREVQSSEARALARTWKCAFMETSAKLNHNVKELFQELLNLE 
KRRTVSLQIDGKKSKQQKRKEKLKGKCVIM 


6615 


3 


1191 


SSAAEAMRVLVRRCWGPPIiAHGARRGRPSPQWRAIARLGWEbCR 
DSRVREKPPWRVLPFGTDQFAREALRALHAARENKEEELIDKLE 
WTMPSPSPKGIiPVKQYAVQSQLPVYEWPDVGSGEYDVGWASF 
GRLLNEALILKPPYGILNVHPSCLPRWRGPAPVIHTVI.HGDTVT 
GVTIMQIRPKRFDVGPILKQETVPVPPKSTAKELEAVLSRLGAN 
ML IS VLKKL PESLSNGROOPMEGATVa P V T c ar"PQ r" T vMr-TTz-im « 

EQIFRLYRAIGNIIPLQTLWMANTIKLUDLVEVNSSVLADPKLT 
GQALIPGSVIYHKQSQILLVYCKDGWIGVRSVMLKKSLTATDFY 
NGYLHPWYQKNSQAQPSQCRPQTLRLPTKKKQKKTVAMQQCIE 


66a6 


832 


35 

1 


GRVQAGASAMSELPGDVRAFLREHPSIiRLQTDARKVRCiLTGHE 

LPCRLPBLQVYTRGKKYQRLVRASPAPDYAEFEPHIVPSTKNPH 

QLFCKLTLRHINKCPEHVLRHTQGRRYQRALCKYBEOQKQGVEY 

VPACLVHRRRRRBDQMDGDGPRPRBAPWEPTSSDBGGAASDDSM 

rDLYPPELFTRKDLGSrBDGDGTDDPXTDKEDEKAKPPREKATD 

3GRRETTVYRGLVQKRGKKQLGSLKKKFKSHHRKPKSFSSCKQS 
J 




347 


1886 ] 

1 


jLPPCQGARPLSSPPHASEDNLFLFWNCILCAFPHPSPQPLQYP 
mPLLLVITQiPAPRHLRNRPPSFSRGGLDSFSGSLSTPSICRS 



530 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


j Preolicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acxd segment containing signal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E^^ 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=IsQleucine, KsLysine, 
L=Leucine, Mr^Methionine, K=Asparagine , 
P=Proline; Q^Glutamine, R-Arginine, 
S=sSerine, Ti=Threonine , V«Valine, 

jr * — lyjTosine, A=unJcnown, *=»Stop 
Codon, /^possible nucleotide deletion, 
_ \=pog5ible nucleotide insert ion) 


6617 






PAWVKMAPWi-PKGLVPAVLWGLSLFLNLPGPIWLQPSPPPQSSP 
PPQPHPCHTCRGLVDSFNKGLERTIRDNPGGGNTAWEEENLSKY 
^w^e. J. i^ii V a vjjot^ vc&Ki>DJf JiCHRLliBIiSEELVES WWFHKQOEA 
PDLPQWLCSDSLKLCCPAGTFGPSCLPCPGGTERPCGGYGQCEG 
EGTRGGSGHCDCQAGYGGEACGQCGLGYFEAERNASHLVCSACF 
GPCARCSGPEESNCLQCKKGWALHHLKCVDIDECGTBGANCGAD 
Uf '-vw iiiGSXECRDCAKACLGCMGAGPGRCKKCSPGYQQVGSKC 
LDVDECETEVCPGENKQCENTEGGYRCICAEGYKQMEGICVKEQ 
IPESAGFFSEMTEDELWLQQMFFGIIICALATLAAKGDLVFTA 
IPIGAVAAMTGYWLSERSDRVLEGPIKGR 


6618 


118 


673 


VWMAWQVSLIiELEDRLQCPICLEVFKESLMLQCfGHSYCKGCLVS 

LSYHLDTKVRCPMCWQAVDGSSSLPNVSIjAWVIEALRLPGDPEP 

KVCVHHRNPLSLFCEKDQELICGLCGI.LGSHQHHPVTPISTVCS 

RMKSEIiAALFSELKQEQKKVDELIAKLVKNRTRIDGSAPSLCPC 
LGPATFTFL 




b4 8 


136 


DGKVARRAPJ>iSPAFQNDIYPLVSAPRATlAKSPWSKVLQNTQCR ~ 

NVPKMTSERSRIPCLSAAAAEGTGKKQQEGRAMATLDRKVPSPE 

APLGKPWSSWrnAAKLHCSDNVDLEEAGKEGGKSREVMRLNKEA 
WKYGT 


6619 


246 


842 


PASSEVLTAAVMFLLLNCIVAVSONMGIGKNGDLPRPPliRNEFR 
YFQRMTTTSSVEGKQNIiVIMGRKTWFSIPEKNRPLKCRINLVLS 
RELKEPPQGAHFIARSLDnALKLTERPELAWKVDMIWIVGGSSV 
YKEAMNHLGHLKLFVTRIMQDFESDTPFSEIDLEKYKLLPEyPG 
ILSDVQEGKHIKYKFEVCEKDD 


6620 


3 


1879 


nSRVDDFVARARMAAENEASQE3ALGAY3PVDYMSITSFPRLPE 
DEPAPAAPIiRGRKDEDAFLGDPDTDPDSPLKSARLQRLPSSSSE 
MGSQDGSPLRETRKDPFSAAAAECSGRQDGLTVIVTACLrFATG 
VTVALVMQIYFGDPQXFQQGAWTDAARCTSLGIEVLSKQGSSV 
DAAVAAALCLGIVAPHSSGLGGGGVMLVHDIRRNESHLIDFRES 
APGALREETLQRSWETKPGLLVGVPGMVKGLHEAHQLYGRLPWS 
QVIAFAAAVAQDGFNVTHDLARAjUAEQLPPNMSERFRBTPLPSG 

HAGGVITEEDPSNYSAIiVEKPVCGVYRGHLVLSPPPPHTGPALI 
SALNILBGFrn^TSLVSREQAiaWVAETLKIAIiALASRLGDPVYD 
w A i ioiai ujui JuortviSAAX "KtsrlXNDSQAAPAPLLrPVYEIiDGAPT 
AAQVLIMGPDDFIVAMVSSLNQPFGSGLITPSGILIiNSQMLDFS 
WPWRTANHSAPSLENSVQPGKRPLSFLLPTWRPAEGLCGTYIA 

LGANGAARGLSGLTQVRFTPWLAFFSREPSCGLDCRCLSYLWLV 
SIPHAANMG 


6621 
21^22 


1 


662 


VQGirSYQQRl.QAiaiKEKSRDAARSRRGKENPEFYEIAKLLPLP 
AAI TSQiajKAS 1 IRLTISYLKMRDFANQGDPPWNLRMEGPPPNT 
SVKVIGAQRRRSPSALAIEVFEAHLGSHIIiQSLDGYVFAIiNQEG 
KFLYISETVSlYLGI^QVELTGSSVFDYVHPGDHVEMAEQIiGMK 
LPPGRGLLSQGTAEDGASSASSSSQSETPEPWCFPPASDQFLL 


6623 


2 


319 


^.itAiitrAyjstJTliAGGPERARAMEANMPKRKEPGRSLRIKVISMGN 
AEVGKSCIIKRYCEKRFVSKYIATIGIDYGVTKVHVRDREIKVN 
IPDMAGHPPFYEVRKPF 




1886 


189 

I 
] 
1 


KAIiFEKVKKFRLHVEEGDiLYAMYVRQTVLKVIKFLI 1 1 AYNSA " 

LVSKVQFTVDCNVDIQDMTGYKNFSCNHTMAHLFSKLSFCYLCF 

VSIYGLTCLYTLYWLPYRSLRBYSFEYVRQETGFDDIPDVKNDF 

AFMLHMrDQYDPLySKRFAVFLSEVSENKLKQLNLNNEWTPDKL 

RQKLQTNAHNRLELPLlMtjSGLPDTVFEITELQSIiKriEIIKMVM 

IPATIAQLDNLQELSLHQCSVKIHSAALSFLKENLKVLSYKFDD 

^ELPPWMYGLRNLEE1,YLVGSLSHDISRNVTLESLR0I>KSI.KI 

[.S I KS^^^S KI PGAVVDV^SHLQKMCIHNDGTKLVMrjmLiCKiy™ 

jTELELVHCDLERIPHAVFSLLSLQEtiDLKENNLKSIEEIVSFQ 



531 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresjponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Aramo acia segment containing signal peptlH^ 

(A=Alanine, C=Cfvahein*» n^sc^^^i-^ „ * ■ j ^ 
1 ^-^-ywteine, U"Aspartic Acid, E=> 

Glutamic Acid, F^Phenylalanine, G=Glycine, 

H=Histidine, I=Isoleucine, K=Lysine, 

L=Leucine, M=Methionine, NaAsparagine 

P=Proline, Q=Glutamine, R=Arginine, 

S=Serine, T=Threonine, V=Valine, 

W=Tryptophan, Y^Tyrosine, X=Unknovm, *^Stop 

Codon, /^possible nucleotide deletion, 

\=possible nucleotide insertion) 


6624 






iil.KK£.TVLKLWHysiTYl-i>EHlKKLTSLERLaFSHMKIEVLPSH 

LFLCNKIRYLDLSYNDIRFIPPEIGVLQSLQYPSITCNKVESLP 

DELyFCKKLKTLKIGKNSI.SVLSPKIGNLLPLSYLDGKC5NHFEI 
LPPELGDCRALKRAGLWKDTILPPTT Dcm7DT?rMwvfr>« 


6625 


218 


1786 


GSRRGGGSRXPAVSTHVAPGRSVLRPFASGALRLRSLVKALGGC 

RGRPSGLAHLSQETSHWRAKRSGRACLGDFPGEILRSFIMKCTA 

REWLRVTTVLFMARAIPAMWPNATLLEKLLEKYMDEDGEWWIA 

KQRGKRAITDNDMQSILDLHNKLRSQVYPTASNMEYMTWDVELE 
RSAESWAESCLSJEHGPAST.TiP^TfsnKFT r^atJT.Tr^■n\7r»l^T^^n™...— — , 
v-jLi^atioi-iWiiiji^ij Xt^NXjGAHWGRYRPPTFHVQSW 

YDEVKDPSYPYEHSCBrPYCPFRCSGPVCTHYTQWWATSNRIGC 
AINLCHNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCS 
ACPPSFGGGCRENLCYKEGSDRYyppREEETNBIERQQSQVHDT 
HVRrRSDDSSRNEVISAQQMSQiVSCEVRLRDQCKGTTCNRYEC 
PAGCLDSKAKVIGSVHYEMQSSICRAAIHYGIIDNDGGWVDITR 
QGRKHYPIKSNRNGIQTIGKYQSANSFTVSKVTVQAVTCETTVB 
_QLCPFHKPASHCPRVYCPRKLYASKSTLCSCrrawSSr>P 


6626 


1124 


S43 


^Ub'KiiL^USLI.STlUVUjRb'KGLGMHPGPSSGGTKGGVPTALRPP ' 

GPLVPSTSDDWLLKNIELFDKI^ALRFHGRLLFLKDVLGDEICCW 

SFYGOGRKlAEVCCTSIVYATEXKQTKVEFPEARIPEErriNILJ 

YETPRGPDPALLBATGGAAGAGGAGRGEDEEWREHRVRRrHVRR 
HITHDERPHGQQIVFKD 


■ 6627 ■ 


3 


1498 


iiAVKFVYTDRrHbliiGiSVEFLCSLRSDATMKSlTACLHALnAL 
LDVPWPRSKIGSDQDSGIELLNVLHRVILTRESPSIQLASLEW 
RQIICAAQEHVKEKRRSAEVDDGAAEKETI,PEFGEGKDTGGLVP 
GKSLVPATLELCVCILVRQLPELNPiaTGSPGVKATKPQILLED 
GSRLVSAALVILSELPAVCSPEGSISILPTILYLTIGVLRETAV 
JM.i'OtrQi^SSTVAASLQALKGILSS PMARAEKSRTAWTDLIiRSAL 
TTILDCWDPVDETHQBLDEVSLLTAITVFILST5PEVTTIPCLQ 
KRCIDKFKATLEIKDPWQIKTYQLLHSIFQYPNPAVSYPYIYS 
LASCIMEKLQEIDKRKPENTAELEIFQEGIKVLETLVTVAEEHH 
RAQLVACLLPILISFLLDENSLGSATSIMRNIiHDFALQNLMOIG 
PQYSSVFKSLVASSPALKARLEAAIKGNQESVKVKIPTSKYTKS 


6628 


1 


697 


mPHI.SSRDMTGTPGAVArRDGBAPERSPPC5P5YDLTGKVMLL- 

GDTGVGKTCFIilQFKDGAFLSGrFIATVGIDFRNKWTVDGVRV 

KLQIWDTAGQERFRSVTHAYYRDAQALIiLYDITNKSSFDNIRA 

WLTEIHEYAQRDWIMLLGNKADMSSERVIRSEDGETLAREYGV 

PFLETSAKTGMNVELAFIAIAKBLKYRAGHQADBPSEX3IRDYVE 
SQKKRSSCCSFM 


6629 


1 


1861 

i 

I 

i 
c 


UUAi^yGGGSGQGGGSGGGGSGGGRGAGGEENKEMERPSAGSKAN 

KEPGDSLSLBILQIIKESQQQHGLRHGDF^YRGYCSRRQRRLR 
KTLNPKMGNRHKFTGKKVTEEriLTDKTPVT t r ^rr MnAwo^rr/,,,* 

iw*\*\v AooiJij j.jjnKijjjjijVJLiMDAERAWSYAM 

QLKQEANTEPRKRFHLLSRLRKAVKHAEELERLCBSNRVDAKTK 

I^EAQAYTAYLSGMLRFEHQEWKAAIEAFNKCKTIYEKIASAFTE 

EQAVLYNQRVEEISPNIRYCAYNIGDQSAINEIiMQMRLRSGGTB 

GLLAEKLEALITQTRAKQAATMSEVEWRGRTVPVKIJDKVRIFLL 

3LADNEAAIVQAESEETKERLPESMLSECRDAIQWREELKPDQ 

KQRDYILEGEPGKVSNLQYLHSYLTYIKLSTAIKRKENMAKGLQ 

WLLQQQPEDDSKRSPRPQDLIRLYDIILQNLVELLQLPGLEED 

<AFQKEIGLKTLVFKAYRCPFIAQSyVLVKKWSEALVLYDRVIiK 

fANBVNSDAGAFKNSLKDLPDVQELITQVRSEKCSLQAAAILDA 

JDAHQTETSSSQVKDNKPLVERFETFCLDPSLVTKQAWLVHPPP 

JFQPIPCKPLPFDLALNHVAPPPLEDKLBQKTKSGL'TOYIKGIP 
JFRS 




5653 


4549 G 
H 


ATPLGSVGGRTGKMDAATLTYDTLRPAEPEDFPETSEPVWILG 
KYSIFTEKDEILSDVASRLWFTYRKNFPAIGGTGPTSDTGWGC 



532 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


yjredicted 

beginning 

nucleotide 

location 

vo vx. i copontxing 

to first 
amino acid 
residue of 
amino acid 
sequence 


1 Predicted end" 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
(A=Alanine, C«Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
HaHistidine, I=Isoleucine, K^Lysin©/ 
L^Jbeucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R^Arginine, 
S=Serine, T=^Threonine, V=:Valine, 
W=7ryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








MLRCGQMIfAQALVCRHLGRDWRWTQRKRQPDSYFSVIiNAFIDR 
KDSYYSIHQIAQMGVGEGKSIGQWYGPNTVAQVLKKLAVFDTWS 
SLAVHlAMDNTAWaEEIRRI,CRTSV?CAGATAPPADSDI2HCWGF 
PAGAEVTNRPSPWRPLVLLrPLRLGLTDINEAYVETLKHCFMMP 
QSLGVIGGKPNSAHYPIGYVGEELIYU5PHTTQPAVEPTDGCFI 

PDESFHCQKPPCaMSlAELDPfllAWRGGHLSTQAFGAECCLGM 
TRKTFGPLRFFPSMLG 


6630 


2 


423 


LVQCGGIRRRSAWGAMPGRHVSRVRALYKRVLQLHRVLPPDLKS 
LGDQYVKDEFRRHKXVGSDBAQRFIiQKWEVYATALLQQANENRQ 

NSTGKACFGTFLPEEKLNDPRDEQIGQLQEIiMQEATKPJSIRQFSI 
SBSMKPKF 


6631 
" 6632 


2 


423 


LVQCGGIRRRSAWGAMPGRHVSRVRALYKRVLQLHRVLPPBLKS" 
LGDQYVKDEFRRHKTVGSDEAQRPIiQEWEVYATALLQQANENRQ 

NS TGKACFGTFLPEEKIiNDFRDEQ I GQLQEIiMQEATKPNRQFS I 
SESNKPKF 




1273 


568 


WNSRGRTC3RGAAPLAPAAAMKAWQRVTRftSVTVGGEQISAIGR 
GICVI*LGISLEDTQKELEHMVRKII*NLRVFEDESGKHWSKSVMD 
KQYEILCVSQPTl.QCVLKGNKPDFHtAMPTEQAEGFYNSFLEQL 
RKTYRPELIKDGKFGAYMQVHIQNDGPVTIELESPAPGTATSDP 

KQLSKLEKQQQRKEKTRAKGPSESSKBRNTPRKEDRSASSGAEG 
DVSSEREP 


6633 


1145 


617 


ATGRHEGVPTLEGX IQQLVNGIITPATIPSLGPWGVUHSNPMDY ' 
AWGANGLDAIITQLLNQFENTGPPPADKEKIQALPTVPVTEEHV 
GSGLECPVCKDDYALGERVRQLPCNHLFEIDGCIVPMLBQHDSCP 
VCRKSLTGQNTATMPPGLTGVSFSSSSSSSSSSSPSNENATSNS 


6634 


1 


1134 


CGGI PRKGSGPRRRLPMARLRDCLPRLMLTLRSLLFWSLVYCYC " 

GLCASIHLLKLLWSLGKCPAQTFRRPAREHPPACLSDPSLGTHC 

YVRIKDSGLRFHYVAAGERGKPLMLLLHGFPEFWYSWRYQLRKF 

KSBYRWAtiDLRGYGETDAPIHRQNYKLDCLITDIKDIliDSLGy 

SKCVLIGHDWGGMIAWLIAXCYPEMVMKLIVINPPHPNVFTEYI 

T.RHPAQLLKSSYYYFFQIPWFPEFMPSINDFKVLKHLFTSHSTG 

IGRKGCQLTTEDLEAYIYVFSQPGALSGPINHYRNIFSCLPLKH 

HMVTTPTLLLWGEWDAFMEVEMAEVTRFYVKNYPRLTILSBASH 

WLQQDQPDI VNKL I WTFLKEETRKKD 


6635 - 


1420 


470 


BMRAGQQLASMr.RWTRAWRLPREGLGPHGPSFARVPVAPSSSSG"" 

GRGGAEPRPLPLSYRI^LDGEAALPAWFIjHGLFGSKTNFNSIAK 

ILAQQTGRRVLTVDARNHGDSPHSPDMSYEIMSQDLQDLLPQLG 

lvpcvwghsmggktamllalqrpelverliavdispvestgvs 

HFATyVAAMRAIWIADELPRSRARKlJU>SOI.SSVIQDMAVRQHl, 

ltnlvevdgrfvwrvnuyu^tqhlidkilafpqrqesylgptlfii 
lggnsqfvhpshhpeimrlfpraqmqtvpnaghwihadrpqdpi 
aairgflv 


6636 


1514 


1801 


sfcmfshkqdshfqavpvqekkbcrlrrapwrafaqpqrlkhpae 

V r i vrt V v-iJUKr ijv-*^ V j>j jt* V uy y XjPP s liGP viiS PHS DPGWCRVD 
DGGDGVP 


6637 


2 


1501 


cssspcfhdgtcvldkagsykcaclagytgqrcenlLeagkski 
kasedslsvleerncsdpggpvngyqkitggpgliwgriiaiagt 
wsffcnnsyvlsgmskrtcqqngewsgkqpicikacrepkisd 

LVRRRVIjPMQVQSRETPLHQLYSAT^SKQKIiQSAPTKKPALPFG 
DLPMGYQHLHTQLQYKCISPFYRRIjGSSRRTCLjRTGKWSGRAPS 
CIPIOGKl ENITAPKTQGLRWPWQAAI YRRTSGVHDGSLHKGAW 
FLVCSGALVNERTNAn^AAHCVTOLGKVTMXKTADLKVVLGKFYR 
DDDRDEKriQSLQISAlILHPNYDPILLDADIAILKLLDKARIS 
TRVQPIGIiAASRDLSTSFQESHIXVAGWNVLADVRSPGPKNDTL 
RSGWS WDSLLCEEQHEDH6I PVS VTDNMFCASWEPTAPSDI C 
rAETGGIAAVS FPGRASPEPRWHLMGLVSWS YDKTCSHRLSTAF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue o£ 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
cor re s p ondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A^Alanine, C-Cysteine, D=Aspartic Acid, 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
HoHistidine, I«Isoleucine, K«Lysine, 
L=Leucine, M=Methionine, lI=Asparagine, 
P==Proline, Q=Glutamine, R=Arginine, 
Ss^Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosiae, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=pcssible nucleotide insertion) 








TKVLPFKDWIERNMK 


6638 


1391 


224 


GGIPQAGGKMAAPWWRAALCECRRWRGFSTSAVLGRRTPPIiGPM 
PNSD I DLSNLERLEKYRSFDRYRRRAEQEAQAPHWWRTYREYFG 
EKTDPKEKIDIGIjPPPKVSRTQQLLERKQAIQELRANVEEERAA 
RLRTASVPLDAVRAEWERTCGPYHKQRIjAEyyGLyRDLFHGATF 
VPRVPLHVAYAVGEDDLMPVYCGNEVTPTEAAQAPEVTyEAEEG 
SLWTIiLLTSlDGHIiLEPDAEYIiHWLli'miPGNRVAEGQVTCPYL 

ppfpargsgihrlafllfkqdqpidfsedarpspcyqlaortfr 
tfdpykkhqetmtpaglsffqcrwddsvtyifhqlldmrepvfe 
fvrpp pyhpkqkrf phrqplryldryrdsheptygi y 


S639 


2046 


1268 


igcfimdggddgnliikkrfvseaelderrkrrqeewekvrkpe 
dpeecpeevydprslyerlqeqkdrkqqbyeeqfkfknmvrgld 

EDETNPLDEVSRQQELIEKQRREEEIjKELKEYRNNLKKVGISQE 

nkkevekkltvkpietknkfsqakliiagavkhkssesgnsvkrl 
kpdpepddknqepssckslgntslsgpsihcpsaavcigilpgl 
gaysgssdsesssdsegtiwatgkivssifrtntpleap 


6640 


117 


1043 


VLEPPDVSMAESEDRStRlVT*VGKT<S6<:jkSATANTlLGEEiFDS 
RIAAQAVTKNCQKASREWQGRDLLWDTPGLFDTKESIiDTTCKE 
ISRCIISSCPGPPIAIVIiVLLLGRYTBEBQKTVAIilKAVFGKSAM 
KHMVILFTRKEELEQQSFHDFIADADVGLKSIVKECGNRCCAPS 
NSKKTSKAEKESQVQELVELIEKMVQCNEGAYFSDDIYKDTEBR 
LKQREEVLRKIYTDQLWEEIiCLVEEDKHKSEEKKEKEIKDLKLK 
YDEKI KNIREEAERNIFKD VFNRI WKMLSEI WHRFLS KCKPYSS 


6641 


1 


834 


SAAVGRRSEVRGCAPRPRLRRSARRMDPVPGTDSAPIAGIAWSS 
ASAPPPRGFSAISCTVEGAPASFGKS FAQKS GYFLCLSSLGSLE 
NPQENWTUDIQI WDKS PLPLGFS PVCDPMDSKAS VSKKKRMCV 
KIiLPI^ATDTAVFDVRLSGKTKTVPGYLRIGDMGGFAIWCKKAK 
APRPVPKPRGLSRDMQGLSLDAASQPSKGGIjLERTASRIGSRAS 
TLRRNDSIYEASSLYGISAMDGVPFTIiHPRFEGKSCSPIiAFSAF 
GDLT I KSIiAD IEEE YNYGFWEKTAAARLPPS VS 




22 


129^ 


PLEERMMTKMDPNDQAQRD 1 1 FSLRRIAFDAESDPSNAPGSGTE 

KRKAMYTKDYKMIlGFTNHINPAMDFTQTPPGMrJ^*D^^ 

HQDTYIRIVLENSSREDKHECPFGRSAIELTKMLCEILQVGELP 

NEGRNDYHPMFFTHDRAFEELPGICIQLLNKTWKEMRATAEDPN 

KVMQWREQITRALPSKPNSLDQFKSKLRSLSYSEILRLRQSER 

MSQDDFQSPPIVELREKIQPEILELIKQQRLNRLCEGSSFRKIG 

NRRRQERFWYCRLAIxNHKVIiHYGDLDDNPC3GBVTFESLQBKrPV 

ADI KAIVTGKDCPHMKEKSALKQNKE VIiEIiAFS ILYDPDETUIF 

lAPNKYEYCIWIIXSLSALLGiOJMSSELTKSDIjyrLLSMBMICLRL 

LDLENIQI PEAPPPIPKEPSSYDFVYHYG 


6643 


3049 


2265 


SLHAPAEGRTRGRLAEKPKMLTRKIKLWDINAHITCRLCSGYIil 
DATTVTECLHTFC|?SCLVKyijEENNTCPTCRIVIHQSHPLQYIG 
HDRTMQDIVYKbVPGLQEAEMRKQREPYHKLGMEVPGDIKGETC 

CLECNSSKLRGLKRKWIRCSAQATVLHLKKFIAKKIiNLSSFNEL 
DIX.CNEEILGKDHTLKFWVTRWRFKKAPLLLHYRPKMDLL 


6644 


1489 


290 


FRPLATEPRGSSPVQLVSSTMSVRTLPLLFLNLGGEMLYILDQR 
LRAQNIPGDKARKVLNDIISTMFNRKFMEEliFKPOELYSKKALR 
TVYERLAHASIMKIiNQASKDKLYrLMTMAFKYQVLIjCPRPKXlfVL 
LVTFNHLDTIKGFIRDSPTILQQVDETLRQLTEIYGGLSAGEFQ 
LIRQTLLIFFQDLHIRVSMFLKDKVQNKNGRFVLPVSGPVPWGT 
EVPGI* I RMFJNWKGEE VKRI EPKHGGNYVPAPKEGSPEFYGDRVL 
KLGTNMYSVNQPVETHVSGSSKNIASMTQESIAPNPLAKEELNP 
LARLMGGMEIKKPSGPBPGFRLNLFTTDEEEEOAALrRPEELSY 
EVINIQATQDQQRSEELAR IMGEFE ITEQPRLSTSKGDDLliAMM 
DEL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end" 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=5Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
HsHistidine, I^Isoleucine, K-Lysine, 
L^Leucine, M-Methionine, N-Asparagine , 
P=Proline, Q^Glutamine, R=Arginine, 
SsSerine, T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /r^possible nucleotide deletion, 
\=possible nucleotide insertion) 


S645 


6S30 


4646 


FVEGLAGYVYKAASEGKVLTLAALLLNRSESDIRYLLGYVSQQG 
GQRSTPLl lAARNGHAKWRLLLEHYRVQTQQTGTVRFDGYVlD 
GATALWCy^AGAGHFEVVKltLVSHGANVWHTTVTNSTPLRAACFD 
GRLDIVKYLVENNAWISIANKYDNTCLMIAAYKGHTDWRYLLE 
QRADPNAKAHCGATALHPAAEAGHIDIVKELIKWRAAIWNGHG 
MTPLKVAAESCKADWELLLSHADCDRRSRIEALELLGASFAND 
RENYDIIKTYHYLYLAMLERFQDGDNILEKEVLPPIHAYGNRTE 
CRNPQEIiES I RQDRDALHMEGIil VRER ILGADNIDVSHPI I YRG 
AVYADNMEPEOCIKLWliHAT.HT.PrtVftMPlsrruvnT t c»t?nrrtT"cte?/>*# 

IHliNETVKAPDIECVLRCSVLEIEQSMKRVKNISDADVHNAMDN 
YECNLYTFLYLVCISTKTQCSEEDQCKINKQIYNLIHLDPRTRE 
GFTLLHIiAVNSNTPVDDPHTNDVCSFPNALVTKLriLDCGAEVNA 
VDNBGNSALHI IVQYNRPISDFLTLHSII ISLVEAGAHTDMTNK 
QNKTPIiDKSTTGVSEILLKTQMKMSIiKCIAARAVRANDINYQDQ 




176 


890 


PSSRMNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESF" 
EGREKKGISDVRRTFCLFVTFDIiLFVTLLWIIELNVWGGIENTL 
EKEVMQYDYYSSYFDIFLLiAVFRFKVLILAYAVCRLRHWWAIAL 
A i*4v xonr ijwitv.v Aij^iUjh owGAFGxVIjPIlSFIIjAWIETWFIiD 

PKVLPQET^EENRLLIVQDASERAALIPGGLSDGQFYSPPESBA 
GSEEAEEKQDSEKPLLEL 


~6S48 


176 


890 


PSSRMNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESY 

EGREKKQISDVRRTFdiFVTPnT.T.m/TT.T.MT T PT KTinar*/- T cvwr 

EKEVMQ YDY YSS YFDI FLLiAVFRFKVLI LAYAVCRLRHWWAI AL 
TTAVTSAFLLAKVTLSKLFSQGAFGYVLPIISFIIAWIETWFLD 
FKVIiPQEAEEENRLLIVQDASERAALIPGGLSDGQFYSPPESEA 
GSEBAEEKQDS EKPLLEL 




413 


897 


RKCWNCFTKYFNSPPEDIDHKDSYLITRSIMABPDYIEDDNPEiT" 
IRPQKLlNPVKTSRNHQDIiHRELLMNQKRGIiAPQNKPELQKVME 
KRKRDQVIKQKEEEAQKKKSDLBIELLKRQQKLEQLELEKQKLQ 
EEQENAPEFVKVKGNLRRTGQEVAQAQES 


6649 


1357 


832 


WIPRAAGIRHEVKWDVKEIMSQHNIYVDAliliKEFEQFNRRLNEV 

skrvriplpvsnilwehcirlanrtivegyanvkkcsnegralm 
qldfqqflmklekltdirpipdkefvetyikayyltendmerwi 
kehreystkqltntlvnvclgshinkkarqkllaaiddidrpkr 


6650 


32 


765 


LVPIjVFSLLVQSCKQVYRSIAMKPVPCLLLVTLSCLGTLGQAPR 
QKQGSTGEEFHFQTGGRDSCTIVIRPSSIiGQGAGEVWLRVDCRNTD 

qtywceyrgqpsmcqafaadpksywnoalqelrriihhacqgapv 
lrpsvcreagpqahmqqvtsslkgspepnqqpeagtpslrpkat 
vklteatqlgkdsmeelgkakpttrptakptqpgprpggneeak 
kkawehcwkpfqalcaflispprg 


6651 


3425 


1353 


AKELLKVGDFSLCAGPYQNTADTMENLSKEPIiASFVSESFBISA 

cgiatehvkidnsgegltaeagsetlsrdgevgvnsdmhyelsg 
dsdldliigdcrnprldledsytlrgsytrkkdvptdgyesslnf 
hnnnqedwgcsswvpgmetslppghwtaavkkeekcvppyvqir 
dlhgilrtyanfsitkelkdtmrtshglrrhpsfsancglpssw 

TSTWQVADDLTQNTLDLEYLRFAHKLKQTIKKGDSQHSASSANV 

fpkesptqxsigafpstkiseapflhpaprsrspllvtwesdp 
rpqgqprrgytassldsssswrercshnrdlrnsqrnhtvsfhil 
nklkynstvkesrndislliinbyabfnkvmknsnqfipqdkeiin 

DVSGEATAQEMYIiPFPGRSAS YEDI I IDVCTNLHVTOiRS WKBA 

ckstflfylveteoksffvrtknldrkgghteiepqhfcqafhr 
endtliiiirnedisshlhqipsllklkhppsvipagvdspgdv 
ldhtyqelfraggfvrsddki i.eavtlvqlkei iki leklngng 
rwkwllhyrekkklkedervdstahkknimlksfqsaniiellh 

YHQCDSRSSTKAEIIiKCLUJLQIQHinARFAVLLTDKPTIPREV 
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SEQ 
ID 
NO: 


Preciicted 

beginning 

nucleotide 

location 

c or respondi ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid. E= 
Glutamic Acid. F=Phenylalanine, G=Glycine, 
H=Histldine, I^Isoleucine, K=Lysine, 
Ii-Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, VssValine, 
W«Tryptophan, Y=:Tyro3ine, X=Un}cnown, *==Stop 
Codon, /^possible nucleotide deletion, 
X^possible nucleotide insertion) 








FENNGI LVTDVNNFI ENIEKIAAPFRSS YW 


66S2 


2 


1343 


IPGSTISCSCHSRRLRGGSFAPRLSLGAASPRPRPPSLPLPLPL 
PPPLFLPTRPAERAWIRSRRASEWVGKMEVPRLDHALWSPTSPC 
EEVIKNLSLEAIQLCDRDGNKSQDSaiAEMEELPVPHNIKISNI 
TCDSFKISWEMDSKSKDRITHYFIDIjNKKENKNSNKFKHKDVPT 
KLVAKAVPLPMTVRGHWFLSPRTEYTVAVQTASKQVDGDYWSE 
WSE I IB FCTADYSKVHLTQLLEKAE VIAGRMLKFS VFYRNQHKE 
YFDYVRHHHGNAMQPSVKDNSGSHGSPI SGKLEGIFFS CSTEFN 
TGKPPQDSPYGRYRFEIAAEKLFNPNTNLYFGDFYCMYTAYHYV 
I LVr APVGS PGDEFCKQRLPQLNSKDNKPLTCTEEDGVLVYHHA 
QDVr LEVI YTDPVDLSLGTVAEITGHQLMS LSTANAKKDPSCKT 
CNISVGR 


6653 


170 


1910 


FFLEPRLRPFPASRARFVPARTRPSPIiHPCCFCFEGGGSMLSPQ 

LLQDHIRDLDWWS VAGAFRKGKS FI LDFMLRYLYSQKESGHS 
NWLGDPEEPLTGFSWRGGSDPETTGIQIWSEVFTVEKPGGiCKVA 
WLMDTQGAFDSQSTVKDCATIFALSIMTSSVQIYNLSQNIQED 
DLQQLQLPTE YGRLAMDE I FQKP FQTLMPLVRDWSFPYE YS YGL 
QGGMAFLDKRLQVKEHQHEEIQNVRNHIHSCFSDVTCFLLPHPG 

SKVTCRQLLBYFKAYIKIYQGEDLPHPKSMLQATAEAYWriAAAA 
SAKDIYYNNMEEVCGGEKPYXiSPDILEEKHCEFKQLALDHFKKT 

IClCMC3f3KT)17fil**D VrV1t?T.T?T?'Ti'TVCT VCXTDr^trtJIVT/^O WttiTsnmrmmT^'* 

VLFTGIVALYlASGLTGPIGtEWAQLFNCMVGLLLIALLTWGY 
IRYSGQYRELGGAIDFGAAyVLEQASSHIGNSTQATVRDAWGR 
PSMDKKAQ 


6654 


1 


70S 


RTSLSPSQCSSFNUAMASAGMQlLGWLTLIiGWVNGLVSCALPM 
WKVTAFIGNS I WAQWWEGLWMSCWQSTGQMQCKVYDSLIiAL 
PQDLQAARAIiCVrALLVALFGLLVYIiAGAKCTTCVEEKDSKARI) 
VLTSGIVFVISGVLTLIPVCWTAHAVIRDFYNPLVAEAQKRELG 
AS LYLGWAASGLLIjLGGGLLCCTCPSGGSQGPSHYMARYS TSAP 
AISRGPSEYPTKNYV 


6655 


341 


16 


me js.j\by luoAUAIj V If a IjP V r AA bH W i DVRVPEQ YQQEH VQGA 
INI PLKEVKERl ATAVPDKNDTVKVYCNAGRQSGQAKEILSEMG 
YTHVENAGGIiKDIAMPKVKG 


€656 


2 


1212 


XELPPRPANLAI QP PLSPLRALAPLPEKPGAVPP PQKRMAKVAK 
DLNPGVKFCMSLGQLQSARGVACLGCKGTCSGFEPHSWRKICKSC 
KCSQEDHCLTSDLEDDRKIGRIiMDS KYSTLTARVKGGDGI RI Y 
KRNRMIMTNPXATGKDPTFDTITYEWAPPGVTQKLGLQYMErirP 
KEKQPVTGTEGAFYRRRQLMHQLPIYDQDPSRCRGIiLENELKIiM 
EEPVKQYKSEALGVGEVALPGQGGLPKEEGKQQEKPEGAETTAA 
TTNGSLSDPSKEVEYVCBLCKGAAPPDSPWYSDRAGYNKQWHP 
TCFVCAKCSEPLVDLiI YFWKDGAPWCGRHYCES LRPRCSGCDE I 
IFAEDYQRVEDLAWHRKHFVCEGCEQXiliSGRAYIVTKGQLLCPT 
CSKSKRS 


6657 


830 


2120 


IiLTCQERAGDCLLSASTMkEWYWSPKKVADWIiLENAMPEYCEP 

LEHFTGQDIiINLTQEDPKKPPIiCRVSSDNGQRLLDMIETLKMEH 

HLEAHKNGHANGHLNIGVDIPTPDGSFSIKIKPNGMPNGYRKEM 

rKIPMPELERSQYPMEWGKTFLAFLYALSCFVLTTVMISWHER 

VPPKEVQ^PLPDTFFDHPNRVQWAFSICEIKTGMILVGLWLIQWL 

LLKYKSIISRRFFCIVGTLYLYRCITMYVTTLPVPGMHFNCSPK 

LFGDWEAQLRRlMKLIAGGGLSITOSHNMCGDYLYSGHTVMLTIi • 

TYLFIKEYSPRRLWWYHWICWIiIiSWGIFCILLAHDHYTVDVW 

AYYITTRLPWWYHTMANQQVLKBASQMNLIiARVWWYRPFQYFEK 

NVQGIVPRSYHWPPPWPWHLSRQVKYSRLVNDT 


6658 


35 


855 


HCCALGAPGSPYRGLYFSSAAPCTAPRKAKHQSTLEGLTKRMLM 
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SEQ 
ID 
NO: 


Predicted 
beginnincf 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C«»Cysteine, D«Aspartic Acid^ E= 
Glutamic Acid, F^Phenylalanine, G=Glycine, 
HsHistidine, I=Isoleucine, KaLysine, 
L=Leucine, M=Methionine, N-Asparagine , 
P^Proline, Q=Glutamine, R^Arginine, 
S^Serine, T=Threonine, V= Valine, 
WzsTryptophan, YsTyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\»possible nucleotide insertion) 








FDPVPVKQEAMDPVSVSYPSNYMESMKPNlCyGVI YSTPLPEKFF " " 

QTPEGIjSHGIQMBPVDLTVNKRSSPPSAGNSPSSLKFPSSHRRA 

SPGLSMPSSSPPIKKYSPPSPGVQPFGVPLSMPPVMAAALSRHG 

IRSPGILPVIQPVWQPVPFMYTSHLQQPLMVSLSEEMENSSSS 

MQVPVIESYEKPISQKKIKIEPGIEPQRTDYYPEEMSPPLMNSV 

SPPQALLQE 


6659 


18 


523 


EPQRGDCETWFQNCSLPKFVCFPCWGFWIiWRAHSMSNIiHSLPGL 
RGIjTS I SRNOLOCTNAMRV T NNTYORI? WKMOMT PT .T.a T Pn-NnriTwi;^ 

CGNPTITCPHNRTl»NNCHHSGVQVPLMYCWLTTPSPQNISNCRY 
AQT PANM FY I VACDNRDnPR DP Pn Y P \r\J P VWT .UT T T 


6660 


514 


1707 


CAASLDCRHHLCEPDMKLVWPSAKLLQAAAGASAEACDSVTSNV 

NG FKDQLCSLVPMALTDPSTQLQLVG IRTLTVIiGAQP DLLS YBD 
LELAVGHLYRLSFLKEDSQSCRVAALEASGTLAALYPVAFSSHL 
VPKLAEELRVGESNLTNGDEPTQCSRHLCCLQALSAVSTHPSIV 
KSTLPLLLQHLWQVNRGNMVAQSSDVIAVCQSLRQMAEKCQQDP 
ESCWYFHQTAIPCLLALAVQASMPEKEPSVLRKVLLEDEVLAAM 
VSVIGTATTHLSPBLAAQSVTHIVPLFLDGNVSFLPENSFPSRF 
QPFQDGSSGQRRLIALLMAFVCSLPRNVSEHIWEVLLFNLDKVT 
PG 


6661 


179 


430 


GVHAASGTLSATWLAEAKMFDSLAKAGKYLGQAAKLM IGMPDYD 
NYVEHMRVNHPDQTPMTYEEFFRERQDARYGGKGGARCC 


6662 


185 


423 


RSLP KPAPAQPAS I HCARFSGVTPPTAKTAMSDGliTAPNALMYC 
GPKADDGNI PSACAPASSAVKASVSVAQPGQAVIP 


6663 


3 


1005 


RPVLSSR VDDFVPPLPBTSGRRKKLERMYS VDRVSDDI PI RTWF 
PKBNLFSPQTASl^TMQAISNFRKKLRMVGSRRVKftQTFAERRER 
S PSRSWSDPTPMKADTSHDSRDSS DLQSSHCTLDEAFEDLDWDT 
EKGLEAVACDTEGFVPPKVMLISS KVPKAEYI PTI IRRDDPSX I 
P ILYDHEHATFED ILEE I ERKLNVYHKGAKI WKMLI FCQGGPGH 
LYLLKNKVATFAKVEKEEDMIHFWKRLSRLMSKVNPEPNVIHIM 
GCYILGNPNGEKLFQNLRTLMTPYRVTPBSPLELSAQGKQMIET 
YFDFRLYRLWKSRQHS KLLDFDDVL 


6664 


S8 


968 


PRLLRLPRS WVMDS PWDELALAFS RTSMPP FFDI AHYLVSVMA 
VKRQPGAAAIAWKNPISSWPTAMLHCPGGGILSCLriLAEPPLKF 
LAKHTMILLAS S I WYITFFCPHDLVSQGYSYLPVQ1.LASGMKEV 
TRTWKIVGGVTHTU^fSYYKNGWIVMIAIGWARGAGGTIITNFERL 
VKGDWKPEGDEWLKMSYPAKVTLLGSVIFTFQHTQHLAISKHNL 
MPLYTIFIVATKITMMTTQTSTMTFAPFEDTLSWMLFGWQQPPS 
SCEKKSEAKSPSNGVGSLASKPVDVASDNVKKKHTKKNE 


6665 


171 


1278 


DERRLACRQWTQQRSELYPGFQKRQRPLPKAGBEAAAQGGRHL 
PGRWLGPQCTQNPCSVHTATGPEPRKLPLLPPDSPNSGYPKEPA 
ALCPGIPSPCRMTHQDLSITAKLINGGVAGLVGVTCVFPIDLAK 
TRLQNQHGKAMYKGMIDCLMKTARAEGPFGMYRGAAVNLTLVTP 
EKAIKLAANDFFRHLLMEDGMQHNLKMEMLAGCGAGMCQVWTC 
PMEMLKIQLQDAGRLAVHHQGSASAPSTSRSYTTGSASTHRRPS 
ATLIAWELLRTQGIiAGL YRGLGATLLRDIPFS 1 1 YPPLPANLNN 
LGFNELAGKAS FAHS FVSGCVAGS lAAVAVTPLDVLKTRI QTLK 
KGLGEDMYSGITDCAR 


6666 


498 


2868 


MTTFLPVPQMMAGFSFGTFGNPPMESPSAWQTIHQPFIVSCLTL 
WSPGCWPQPIQKEG VGLWD XRKPQSS LLRYGGNLSLQSAMS VRP 
KSNGTQLLALRRRLPPVLYDIHSRLPVFQPDNQVYPNSCTMKSC 
CFAGDRDQYILSGSDDFNLYMWRIPADPEAGGIGRWNGAFMVL 
KGHRS I VNQVRPNPHTYMICSSGVBKI IKIWSPYKQPGCTGDLD 
GRIEDDSRCLYTHEEYISLVLNSGSGLSHDYANQSVQEDPRMMA 
FFDSLVRREISGWSSDSDSDLSESTILQLHAGVSERSGYTDSES 
SASLPRSPPPTVDESADKAFHLGPLRVTTTNTVASTPPTPTCED 



537 



wo 01/53312 



PCT/USOO/34263 





SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptiHi" 
(A=:Alanine, C=Cysteine, D-Aspartic Acid, E= 
Glutatnxc Acid, F=.PhenyIalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, NsAsparagine, 
P=Proline, Q^=Glut amine, R=Arginine, 
S=*Serine, T=Threonine, v=:Valine, 
W=Tryptophan, y^Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


6G67 






AASRQQRLSAIiRRYQDKRLLALSNBSDSEENVCEVELDTDLFPR 
PRSPSPEDESSSSSSSSSSEDEEEI.NERRASTWQRNAMRRRQKT 
TRBDKPSAPIKPTNTYIGEDNYDYPQIKVDDLSSSPTSSPERST 
STLErQPSRASPTSDIESVERKIYKAYKWLRYSyiSySNWKDGE 
TSLVTGEADJEGRAGTSHKDNPAPSSSKEACLNIAMAQRNQDLPP 
EGCSKDTFKEETPRTPSNGPGHEHSSHAWAEVPEGTSQDTGNSG 
SVEHPFETKKLNGKALSSRAEEPPSPPVPKASGSTLNSGSGNCP 
RTQSDDSEERSLETICANHNNGRLHPRPPHPHNNGQNLGELEW 
AYSSPGHSDTDRDNSSLTGTLLHKDCCX3SEMACSTPNAGTREDP 
TDTPATDSSRAVHGHSGLKRQR3EL3DT0SENSSSEKKLKT 


6668 


171 

71 A 


1310 


ABEVERLAAMRSDSLVPGTHTttlRRRSKFANLGRIPKPWKWRK 
KKSEKFKHTSAALERKISMRQSREELIKRGVLKEIYDKDGSLSI 
SNEEDSLEKGQSLSSSQLSLPALSEMEPVPMPRDPCSYEVLQPS 
DIMDGPDPGAPVKLPCLPVKLSPPLPPKKVMICMPVGGPDLSLV 
SYTAQKSGQQGVAQHHHTVLPSQIQHQLQYGSHGQHLPSTTGSL 
PMHPSGCRMIDELNKTLAMTMQRLESSEQRVPCSTSYHSSGLHS 
GDQVTKAGPMGLPEIRQVPTWIECDDNKEKVPHESDYEDSSCL 
YTREEEEEEBDEDDDSSLYTSSLAMKVCRKDSLAIKPSNRPSKR 
ELEEKNILPRQTDEERLELRQQIGTKL 


6669 




358 


TmVATGPAI.TI.RCHVCTSSSNCKHSWCPASSRFCKTTMTVEP " 
LRGNLVKKDCAESCTPSYTLQGQVSSGTSSTQCCQEDLCNTEKIiH 
NAAPTRTALAHSAIiSLGLALSLLAVILAPSL j 


6670 


459 


1207 


KDEErRKDYDYMLDHPEEYYSHYYHYYSRRLAPKVDVRWiLVS 
VCAISVPQFFSWWNSYWKAISYLATVPKYRIQATErAKQQGLLK 
KAKEKGKMKKSKEBIRDEEENIIKNIIKSKIDIKGGYQKPQICD 
LliLFQIILAPFHLCSYIVWYCRWIYNFNlKGKEYGEEERLYIIR 
KSMKMSKSQFDSLEDHQKETFLKRELWIKENYEVYKQEQEEELK 
KKLANDPRWKRYRRWMKNEGPGRLTFVDD 


6671 


184 


594 


VARI*GEAAKMS3EPPPPypGGPTAPI,LEEKSGAPPTPGRSSPA 
VMQPPPGMPLPPADIQPPPYEPPGHPMPQPGFIPPHMSADGTYM • 
PPGFyppPGPHPPMGyYPPGPYTPGPYPGPGGHTATVI.VPSGAA 

xrvTV 


6672 


1 


763 


UPAEKPRSAPNMAGGRQGj>OJt,TAIiLAAWIAAVAATAGPEEAALP 
PEQSRVQPMTASKWTliVMEGEWMLKFYAPWCPSCQQTDSEWEAF 
AKNGEILQISVGKVDVZQEPGLSGRPFVTTLPAFFHAKDGIPRR 
yRGPGIPEDI<3NYILEKKWQSVEPLTGWKSPASLTMSGMAGI>FS 
XSGKIWHLHNYFTVTLGIPAWCSYVFFVIATLVPGLSMDLVL*V 
r SQCNWDPP YRHVS * /RP5XNLGVHTAHTSBHLRL 




6673 


304 


1089 


APGSKPVQFMDFEGKTSFGMSVFNLSNAIMGSGlL.61iAYAKAHT~ 

GVIFFLALLLCIALLSSYSIHLLLTCAGIA6IRAYEQLGQRAFG 

PAGKVWATVICLHNVGAMSSYLPIIKSELPLVIGTFLYMDPEG 

DWFLKGNLLIIIVSVLriLPLALMKHLGYLQYTSGriSLTCMLFF 

LVSVIYKKPQLGLCYRATMKQQWBSEALVQTPQPRDSTARVTrart 

MFHS*LTGVLTQWPIMAFAFVCHPGGAGPSITELCRAFQAQD 




6674 


1116 


1963 

] 


LQIQTHHTHHGARVTHLGSHQLLAKAGTMLCRQQSSSMAPAF^Q 
SVTCGPSPCVRKQESATKCLHIGACGSDLWARGWEQ6*G*GLNV 
WLCPCVAPHRGARPQAEEGQARWNSLVSSPWrPPNP*HSSlGAE 
NAVPRP*QG*iCVNPSGQERQS\WVLPLPVPGEPLKLPGLPG*NK 
SFSRV/SGSKGKWILPRQLM*AS*R\TPRFVPGTQWVPITW/PL 
ITWH*SAPTPPI,KACPAPRESDPCSSCLSCPCVIX3KPRFSDTGW 
PGAGHCHSSCDFTRKGAAGGPG 






1 


440 "] 
] 

, J 


L.attiYMCQYDYVEVRDGDNRJDGQIIKRVCGNERPAPIQSI<3SSL 
iVLFHSDGSKMFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSY 
<CACIiAGVTGQRCENLLEERNCSDPG/WPSQMVPEIWRGPWAYQ 
=TPC*IGTRV3^FLT 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

omi,no aCiQ 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
CA«Alanine, C=Cy3teine, D^Aspartic Acid, E== 
Glutamic Acid. F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P= Proline. Q=t31utamine , R=Arginine, 
SsrSerine, T=Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
_ \=po3sible nucleotide insertion) 


6675 


277 


1678 


GIWPTERMAFLDNPTI I IiAHIRQSHVTSDDTGMCEMVLI DHDVD 
LEKIHPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFGIR 
RRSWrAQRLERLRKERQNQXKCKNIQWKEJlNSKQSAQELKSLFE 
KKSLKEKPPISGKQSILSVRLEQCPIiQLNNPFlSrEYSKFDGKGHV 
GTTATKKIDVYLPLHSSQDRLLPMTWTMASARVQDLIGLrCWQ 
VTSBGREPKLNDNVSAYCLHIAEDDGEVDTDFPPI£>SNfEPXHKF 
GFSTIiALVEKYSSPGLTSKESLPVRINAMGFSLIQVDNTKVTM 
KEIIiLKAVKRRKGSQKVSGSRADGV?EEDSQIDIATVQDMLSSH 
HYKSFKVSMIHRLRFTTDVQL/GCAI.FPGVLRKRAAPVDCLRPS 
ADTWRQEQIGCCXSAACAAIiRS ♦DS HKC*EGISGDKVEIDPVTNQ 
KASTKFWIKQKPISIDSDLLCACNDLAEE 


667^ 


377 


1678 


GKWPTERMAFIiDNPTIILAHIRQSHVTSDDTGMCBMVLIDHDVD" 

LEKIKPPSMPGDSGSEIQGSNGETQGYVYAQSVDITSSWDFQIR 

RRSNTAQRtiERLRKERQNQIKCKNIQWKERNSKQSAQELKSLPE 

KKSLKEKP P ISGKQS I LS VRIiEQCPLQLNNPFNEYSKPDQKGHV 

GTTATKKIDVYLPLHSSQDRLLPMTWTMASARVQDI.IGLICWQ 

YTSEGREPKLNDNVSAYCLHIAEDDGEVDTDFPPLDSNEPIHieF 

GFSTLAIiVEKYSSPGLTSKESLFVRINAAHGFSLIQVDNTKVTM 

KEILLKAVKRRKGSQKVSGSRADGVFEEDSQIDIATVQDMLSSH 

HYKS FKVSM IHRLRFTTDVQL/GCALFPGVLRKRAAPVDCLRPS 

ADTWRQEQ IGCCGAACAAIiRS *DSHKC*EGI SGDKVEIDPVTNQ 

KASTKPWI KQKP IS IDSDLLCAC\DIjAEE 


6677 
~ 6678 


277 


1678 


GNWPTERMAFLDNPTI I LAHIRQSHVTSDDTGMCEMVLIDHDVD 
LE KIHPPSMPGDSGSE IQGSNGETQG YVYAQS VDirssWDFGIR 
RRSNTAQRLBRLRICERQWQIKCKNIQWKERNSKQSAQELKSLPE 
KKSLKEKPP I SGKQSILSVRLEQCPIiQZiNNPFNEYSKFDGKGHV 
GTTATKKIDVYLPLHSSQDRLLPMTWTMASARVQDLIGIiICWQ 
YTSEGRBPKLNDNVSAYCLHIAEDDGBVDTDFPPLDSNBPIHKF 
GFSTlxALVEKYSSPGLTSKESLFVRINAAHGFSLIQVDNTKVTM 
KEILIiKAVKRRKGSQKVSGSRADGVPBEDSQIDIATVQDMLSSH 
HYKSPKVSMIHRLRFTTDVQL/GCALFPGVLRKRAAPVDCLRPS 
ADTWRQEQIGCCX3AACAALRS*DSHKC*EGISGDKVEIDPVTNQ 
KASTKFWIKQKPIS IDSDLLCAC\DIiABB 




221 


86S 


GgSNQSSgSLSLIVTGCSSYWS*IND'r'dl'iLUVLSSNFGRQ*LR 
PFPCSQLPMSQGCLWHLDCCCPWVPyiPGQQWRKGRORMRW*QS 
LLGSDQESVGLEDLCVFVNFLriHVLLGLFP*PHELPLLPVVDLG 
FLFPLLLQGGCHCLVLPANLVSQAPQIGKLSCRIiQTHDIiEGSRN 
HHPLFr*WGRWDAVKHLETVQSGriAS LGFVGQHTSHGPP 


6679 


2 


786 


LEFARGtAMPFLGQDWRSPGQNWVKTVDGWKRFLDEKSGSFVSDL 
SSYCNKEVYNKENLFMSLNYD/SCSQEEKBGHAE ♦QNQNS \DFH 
QEKWIYVHKGSTKERHGYCTLGBAFNRLDPSTAXLDSRRFNYW 
RLI,ELIAKSQLTSI*SGIAQKNPMN1LEKWLKVLEDQQKITLIR 
ELLQ7LYTSLCTLVKRVGKSVLVGNINMWVYRMETILHWQQQLN 
NIQITRVSGQAQPPPGSGSLHRDTGQTRQDFSFTPVTEESGIjP 


6680 


1498 


29S1 


PliCTLPLMPSAIiPGWAGERWEKQWPLA/ PGPGTWQTPVGS ISEE 
P\RKNEPDTHCPRGEARPEV •HLPKPHS PGSEGAE IQTSA*AIiP 
/NQVSPPQPM*GAEENGDQRGGKEKAGSEIiHRSSSGLTAAPGP? 
EVHRNLQTPPGLPSRGGGP/GGAGTQGSWAPGEQPP/SPLIjPAS 
MQRSQAGLPGWEAGLVESPTHinPAIiRPSGTNATGEAPPSTTCS 
SQP\PAPPGPTGLRPGGGSSS6GHG**PGLPVGKV\GALGAAQD 
PQSQGRGPTQGTVGTEMI,LSGLGSAKACPAARPAVP*IJPSDPAS 
TIPKKGTRGFGEGPGVLQERNRWWGRAQGFTSADAAGTAPPGV 
* LPAPIiSQPPGATEPQVRACGMAPPS PGTSGRLVAWGRHPGPQV 
AQGCPPGAGCWGSQPRGSQRCPRTYTHSPLGHGRAPCPRRCWH* 

wqdppssprtgclpgipjuiqaysaprtrsrpgirtgraaygpir 
fqggggg 
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SEQ 
ID 

NTO: 


i'redicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide" 
(A=Alanine, C-Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H^Histidine, l=isoleucine, K=Ijysine, 
L-Iieucine, M=Methionine, N=Asparagine , 
P= Proline, Q=Glut amine, R«>Arginine, 
S=Serine, T=Threonine, V«Valine, 
W«Tryptophan, V^Tyroeine, X=Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6681 


1169 


511 


lNYIYYNQgQRAFHELK\EKLMSAPALGLPDLTKLFTLHVSERE~ 
KMTVGVLTQTVGPWSRPGAYLSKQLDGVSKGWPPCPRALAATAL 
LAQEADELTLRQNLNRKSPHAVWTL.lNTTfnwn*! tmhot T^>^rr\ 

TLLCENPHKTIEVSNT/LNPATLLLVTESPVKHNCLEVLDSVYS 

SRPNLRDHP+TSVDWELYVDGSGFANPCKVTLKKETSPAPVTPR 
S 


6682 


109 


1238 


TVLCGTU-IQVSSEiNEVKlYSLSCGKSLPEWLSDRKKRALQKKDVD 
VRRRIBLIQDFEMPTVCTTIKVSKDGQYILATGTYKPRVRCYDT 
YQLSLKFERCLDSEWrPEILSDDYSKIVFLHNDRYlEFHSQSG 
FYYKTRIPKFGRDFSYHYPSCDLYFVGASSEVYRLNLEQGRYLN 
PLQTnAAENWVCDlNSVHGLFATGTXEGRVECWDPRTRNRVGLL 
D\AP*TVSQQIQR*TSLPriSALKFM\GAIiTMAVRTTTV3n\7T t v 

DLRSDKPLLVKDHQYGLPIKSVHFQDSLDLILSADSRIVKMWNK 
NSGKIFTSLEPEHDLNDVCLYPNSGMLLTANETPfO^^GIYYIPVL 
GPAPRWCSFLDNLTEKLESNPBSKE 


6683 


109 


1238 


TVIiCGAMQVSSLNEVKI YSLSCGK5LPBWT.qr>PK-WPaT nvirnwn — 

VRRRIBLIQDFEMPTVCTTIKVSKDGQYlLATGTYKPRVRCYDT 

YQLSLKFERCLDSEWTFEILSDDYSKIVFLHNDRYIBFHSQSG 

FYYKTRlPKFGRDFSYHYPSCDLYPVGASSEVYRLNIiEQGRYIiN 

PLQTDAAENNVCDINSVHGLFATGTIEGRVECWDPRTRNRVGLL 

D\AP*TVSQQrQR*TSIjPTISALKFN\GALTMAVGTTTGQVLLY 

DLRSDKPLLVKDHQYGLPIKSVHFQDSLDLILSADSRIVKMWNK 

NSGKIFTSLEPEHDIiNDVCLYPNSGMLLTANETPKMGIYYIPVL 

GPAPRWCSFLDNIjTEBLEENPESNE 


6684 


111 


527 


GLRGGTSRGRAGREPEFAAGVLCWAGFCQSPCPPGGRGREAPA 
PP\SGRRHA*RCA*WT.nf2D^2rinc/^r«Dc«n»r»y-if? //^rrtr y-^nnKx-nn-m^M 
^dV3Afum ^rrt^ v^jj(ji3Fi3tjJJboCjKciEGGS/QEIiQRAMESKMG 

ELPLDINIQEPRWDQSTFLGRARHFFTVTDPRNLLLSGAQLEAS 
RNIVQNYR 


6685 


258 


1473 


kllgdnfegfckkfelsdsengsns*qspl\fdrlpdpdpqkvl 

QGVIDMKNAVIGNNKQKANLIVL6AVPRLLYLLQQETSSTELKT 
ECAWIjGSLAMGTENNVKfiTiT.nrMTTTaaT t .tiot t cth-^t vc»-r 

lrclrtiftspvtpeellytdatviphj^mallsrsrytqeyicq 
ifshcckgpdhqtii,fnhgavqniahlltslsykvrmqalkcfs 

VLAFENPQVSIylTLVNVLVI)QELLPQIFVK^4LQRDKPIEMQLTSA 
KCLTYMCRAGAIRTDDNCIVLKTLPCliVRMCSKERLLEERVEGA 
ETIAYLIEPDVELQRIASITDHLIAMLADYFKYPSSVSAITDIK 
RLDHDLKHAHEIjRQAAFKLYASLGANDEDIRKKVSLGEGRPPVIj 
TASRQGVTST 


6686 
6687 


310 


927 


dsvtfddlavdftpkewtlldptqrnlyrdvmlenykniatvgy" 

qlpkpsliswlbqbesrtvqrgdfqasewkvqlktkeialqqdv 

lgeptssgiqmxgshngoevsdvkqcgdvssehsclkthvrtqn 

SENTFECYliYGVDFIiTLHKKTSTGEQRSVFSHVWKKPSSLNPDV 
VCQKNRCTRKKKAF* LQLTLGKSFH* S IHT 




181 


915 


EAMLBAPYKKEEDEQQRKEVKKDYPSNTTSSTSNSGKTETSGSST 
IGBTSNRSRDRDRYRRRNSRSRSPGRQCRHRSRSWDRRHGSESR 
SRDHRREDRVHYRSPPLATGEPVDNLSPEBRDARTVPCMQLAAR 
XRPRDLEDFFSAVGKVRDVRI ISDRNSRRSKGIAYVEFCEIQSV 
PLAIGLTGQRLLGVPIIVQASQAEKNRLAAMANWLQKGNGGPMR 
LYVGSLHFNITEDMLRGIFEPFGKV 


6688 

J 


1025 


1 


AEVPIfYPRVi-HKCPDSCWRFKFQPIQLQPYILLSFiiEKPPiSF 
SEPGLPR/SATARMATAAAPPUSS IDLPSDSGMGFI S PAGDSLD 
LPSDGGTGPPSLAGDSSSTRLSSLAFISFSLSSVSVGSSAGTTS 
STSVGSWAAPTSSSSSSTNRDVAGLDFSTVITSVSGSLVPSRE 
VAVICGSKGAQASGSASCSSRAGKTTEATAASSMPSGTSSFSTC 
rMSELEELFSriPSPAPLLSKLPTSSGSIAICCQDSGPSDTGRLS 
/CQLWIiADSDTGKLSDCQEWTVGDSGGIjTCPELSLGRM*MSLL 



540 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corre spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing sxgnai peptide 

(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 

Glutamic Acid, F= Phenylalanine, G=Glycine, 

H**Hi.stidine, I=Isoleucine/ K=Lysine, 

L=:Leucine, M=Methionine, N=Asparagine, 

P= Proline," Q=Glut amine, H«Arginine, 

S=Serine, T=Threonine, V=Valine, 

W=Tryptophan, YssTvrosin** y— TTi-iVMr\f.m * c?*- 

*-*■'■ y ne , a— unKnown, —Stop 

Codon, /*»possible nucleotide deletion, 

\=possible nucleotide insertion) 


6689 


640 


1299 


SSAVlPGYSSSSDSRLNTVPTVDLLCPFQTKSSf ' 

SSSASYATSATSISDTAFSGSLKLKHGLLSAIiDSSSRtS*STSS 
AEDSTFRICSPSVSDTSSDSSGSKDNVLILPSKVSI*SCFSLSS 
FFSDSISFCFSSSSFCKR*FVSSKVSQNALLSSRLSNGPGGSSK 
QRNSIiTARQLAMSL*ATKF*RNACNPNCLSSKKSAL*LSLNQRF 
GGSASRKPGNISFKSQKCSALSYCCNFVIKPREVSVSSENYPAF 


S690 


1 


442 


Li J. ittj AiviftA 1 litrPliGS WQQWR R^lja ARDGSRMLLLLLLLGS GQGP 

QQVGAGQTFEYLKREHSLSKPYQGVGTGSSSIiWNLMGNAMVMTQ 

YIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKrHGQGKKNL\H 
GDGLAIWYTKDRMQP 


S691 
6S92 


2Q7 


1401 


LKTETSEEKARRYKDRPSQLNAVFQEQKKMIQAQESITLBDVAV 
DFTWSEWQLLGAAQKDLYRDVMLENYSNIiVAVGYQASKPDALFK 
liEQGEQIiWTIEDGIHSGACSDIWKVDHVLBRLQSESliVNRRKPC 
HBHDAFENlVHCSKSQFIiliGQNHDIFDLRGKSLKSNLTLVMQSK 
GYEIKNSVEPTGNGDSFLHANHERLHTAIKPPASQKLISTKSQF 
ISPKHQKTRKLEKHHVCSEOSKAFIKKSWLTDHQVMHTGEKPHR 
CSLCEKAPSRKFMLTEHQRTHTGEKPYECPECGKAFLKKSRLNI 
HQKTHTGEKPYICSECGKGFIQKGNLIVHQRIHTGBKPYICNEC 
/GKGFIQKTCLIAIIQHPllTER 




178 


939 


WIKEGELSabWERFCANIIKAGPMPKHIAFIMDGNRRYAKKCQVE 
RQEGHSQGPNKLAETIiRWCLNLGILEVTVYAFSIENFKRSKSEV 
IX5LMD1ARQKFSRI>5EEKEKI^KHGVCIRVLGDLHLLPLDLQEL 
lAQAVQATKNYNKCFLNVCFAYTSRHEISNAVREMAWGVEQGLL 
DPSDISESLLDKCLYTWRSPHPDILIRTSGEVRLSDFLLWQTSH 
SGLVFQPVLWPEYTFWNLFEAILQFQMNHSVLQK 


6693 


178 


939 


WIKEGELSLWERFCAKIIKAGPMPJCHIAFIIvnDGNRRYAKKCQVE" 

RQEGHSQGFNKLAETLRWCLNLGIIiEVTVYAFSIEWFKRSKSEV 

IXSLMDLARQKFSRLMEEKEKLQKHGVCIRVLGDLHLLPLDLQEL 

IAQAVQATKNYNKCFI.NVCFAYTSRHEISNAVREMAWGVEQGLL 

DPSDISESLLDKCLYTNRSPHPDILIRTSGEVRLSDPLLWQTSH 

SCLVFQPVLWPEYTPPfNLFEAILQFQMNHSVLQK 


6634 


292 


813 


SLLLHIAPPCSAYTPSQPLSSVSTETASSVRRQAAESRQHELPVR 
o V no ijuux Jjirv wijTAEAePPEAQDP WGSPG I SLPAAH I G FAAA 
LAVGPSGCHTEP\PDEVWPSLPLGDAYAARDKSKI*IQLGITHW 
NAAAGKPQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLP 


6695 
€ 69^ 


292 


813 


SLLLHIiAPPGAYTPSQPLSSVSTETASSVRRQAAESRQHELPVR 
EVHSLGQILPQDGLTAEAGPPEAQDPWGSPGISLPAAHIGFAAA 
lAVGPSGCHTEP\FDEVWPSLFIiGDAYAARDKSKLIQLGITHW 
NAAAGKPQVDTGAKFYRGMSLEYYGIEADDNPFFDLSVYFLP 




1 


782 


PRVRGRVGERWAPLSVPAAMSSEMEPLUjAWSYFRRRKFQLCAD 
LCTQMLEKSPYI>QAAWILKARAtiTEMVYIDEIDVDQSGIAEMML 
DEKIAIAQVPRPGTSLKLPGTNQTGGPSQAVRPITQAGRPITGPL 
RPSTQSGRPGTMEQAIRTPRTAYTARPITSSSGRFVRLGTASML 
TSPDGPFINLSRLNLTKYSQKPKrAKALIEYIFHHENDVKTALD 

laalstehsqykdwwwk/bqiekcyyrvgmyreaekqikss 


6698 


3 
668 


782 
754 


PPIiFLRRLNSRALRPGSRXVMAWPASLSGQDVGSFAYLTiKDR 
IPQILTKVIiyrLHRHKSBFFEKHGEEGVEAEKKAISLLSKLRWE 

lqtdkpfiplvekfvdtdiwnqyleyqqsllnesdgksrwfysp 

WLLV\ECYMYRRIHEAI\IQSPPIDYFDVPKESKEQKrFYGSQES 
IIALCTHLQQLIRTIEDLD\ENQLKDEFFKLLQISLWGEISVDL 
SIj\SGGESSSQNTNVLNSLEDLKPFILLNDMEHLWSLLSNCK 


£699 


325 


492 


VGSCACAGSCKCKECKCTSCKKSECRAFP 

EGELP/PARRVLPRAMTASAQPRGRRPGVGVGWVTSCKHPRCV 
LLGKRKGSVGAGSFQLPGGHLEPGSTWEECAQRETWEEAALHLK 
CmiFASVVNSFIBKENYHYVTILMKGBVDVTHDSEPKMVEPEKir 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide"" 
(A^Alanine, Cc=Cvsteine D-Asoa-rti f ar'-tr? p— 
Glutamic Acid, F« Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
lj=Leucine, M^Methionine, N=Asparagine, 
P- Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=:Threonine, Va Valine, 
W^Tryptophan, Y=Tyroaine, X-Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








ES KRII YNHAFFFQBSKWSGG I LQ 


€700 


1098 


1392 


TQCWRSSTPGMRTHFRTQP/RLECXJQGFSQQENGHCMDTNECIQ 
FPFVCPRDKPVCVNTYGSYRCRTNKKCSRGYEPWEDGTACVERT 


6701 


2 


1485 


AAAGPRTRVRRAAAFEGQPSPSPGLGPTSDKAAAPRTPKRRRLW " 

RQRQ/HPAMLCYVTRPDAVLMEVEVEAKANGEDCLNQVCRRLGI 

lEVDYFGLQFTGSKGESLWLNLRNRISQQMDGIAPYRLKLRVKF 

FVEPHLILQEQTRHIFFLHIKEALIiAGHIiLCSPEQAVELSAUA 

QTKFGDYNQNTAKYNYEELCAKELSSATLNSIVAKHKELEGTSQ 

ASAEYQVLQIVSAMENYGIEWHSVRDSEGQKLLIGVGPEGISIC 

KDDFSPINRIAYPWQMATQSGKNVYLTVTKESGNSIVLLFKMI 

STRAASGIiYRAITETHAPYRCDTVTSAVMMQYSRDLKGHLASLF 

LNENINLGKKYVFDI KRTSKEVYDHARRALYNAGWDLVSRWNQ 

SPSHSPLKSSESSMNCSSCEGLSCQQTRVLQBKLRKtiKEAMLCM 

VCCEEEIMSTPCPOGHTVCCESCAAQLQVGESAAHFCLQPHLSL 

LLTOS R S OVIAR 

W X w 0 A i9 u V Joruv 


6702 


397 


1971 


PLAKFLKIiDDVWI,CLPMEDVFt>pyRTCiii^CSMGLd6SCHLSl.PK" 

RAEALLCSRKATWRDLVAVRMAEEQE FTQLCKLPAQPSHPHCV 

NNTYRSAQHSQALLRGLLALRDSGILFDXA/LWEGRHIEAHRIL 

LAASCDYPKGMFAGGLKEMEQEEVLIHGVSYNAMCQILHFIYTS 

ELELSLSNVQETLVAACQLQIPEIIHFCCDFLMSMVDEENILDV 

YRriABLFDLSRLTEQLDTYILKNFVAFSRTDKYRQLPLEKVYSti 

LSSNRLEVSCETEV YEGALLYHYSLEQ VQADQI S I>HE PPKLLET 

VRFPLMEAEVLQRLHDKLDPSPLRDTVASALMYHRNESLQPSLQ 

SPQTELRSDFQCWGFGGIHSTPS\MSSATRPKYLNPLLGEWKH 

If lASJjAPRMSNQGIAVLNNFVYLlGGDKX^VQGFRAESR 

RHNRWPQIQSLQQEHADLSVCWGRYIYAVAGRDYHNDLNAVER 

YDPATWSWAYVAPI»KREVYAHAaATLEGKMYITCGRKGRIT 


6703 


45 


1244 


GVGPRAAAMPLELELCPGRWVGGQHPCFIIAEIGQKHQGDLDVA " 

KRMIRMAKECGADCAKFQKSELEFKFNRKALERPYTSKHSKGKT 

YGEHKRHLEFSHDQYREljQRYAEEVGIPFTASGMDEMAVEPLHE 

LNVPFFKVGSGDTNNPPYLEKTAK/TRGWHSVLRDVOSVQLNDE 

TSSWDVLGRVRTSKEKVLMVLVLDYSGRPMVISSGMQSMDTMKQ 

V i u J. vkpjjNPNFCFLQCTSAYPLQPEDVNLRVISEYQKLFPDIP 

IGYSGHETGIAISVAAVALGAKVLERHITLDKTWKGSDHSASLE 

PGEIAELVRSVRLVERALGSPTKQLLPCEMACNEKLGKSWAKV 

KIPEGTILTMDMLTVKYGEPKGYPPEDIFNLVGKKVLVTVEEDD 

TIMES 


6704 


82 


1007 


TMNTRNRWNSGIiGASPASRPTRDPQDPSGRQGELSPVEDQREG 

LANLEKWKEQNRAKPVHLVPRRIiGGSQSETEVRQKQQIiQLMQSK 
YKQKLKREESVRIKKBAEBAELQKMKAIQREKSNKLEEKKRLQB 
NLRREAFREHQQYKTAEFL/RQTEHRIARQKCLSKGCLWPTILN 
MGQKLGLQ\DSLKABBNRKLQKMKDEQHQKSELLELKRQQQEQE 
RAKIHQTEHRRVNNAFIiDRLQGKSQPGGLEQSGGCWWMKrSGNSW 
GI 


6705 


2 


786 


RJUCRNSARVPCGWSASRSLGEGAGFIGPLRGPHPRAGGTGTSFT 
SYKRKGGIMSTIAAFYGGKSILITVATGFLGKELMEKLFRTSPD 
LKVIYILVRPKAGQTLQHRVPQILDSKLFEKVIEVRPNVHEKIR 
AIYADLNQNDFAISKEDMQELLSCTWIIFHCAATVRFDDTLRHA 
VQIiNVTATRQLLLMASQHPKLBAFIHISTAYSNCNLKHlDEVIY 
PCPVEPKKIIDSIiEW\LDnAIIDEITPKLIRDWPNIYTYTK 


6706 


130 


b3l 


PTHSSSSHSQEMLGECLNMLRNDGHFCDITIRVQDKI FRAHKWL 
AACSDFFRTKLVGQAEDENKNVLDLHHVTVTGFIPLLEYAYTAT 
r.S 2NTEN1 IDVIiAAAS YMQMPSVASTCSEFMKSSIIiWNTPNSQP 
EK 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

Cor re spending 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 

nucleotide 

location 

CO r r e spondi ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Amino acid seyment containing signal peptide" 
(A=*Alanine, C=Cysteine, D=Aspartic Acid, 
Glutamic Acid, F-Phcnylalanine, G=Glycine, 
HsHistidine, I=Isoleucine, K-Lysine, 
L=Iieucine, M=Methionine, N=Asparagine , 
P=Proline, Q^Glutamine, R=Arginine, 
S=Se rine, T=Threonine, V= Valine, 
W«Tryptophan, Y=Tyrosine, X-Unknovm, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6707 


2233 


1343 


YWSQiaYELQHFHWRKPHFEKKGPPSTCQBRLYESRSRWPCIS* 
GMVWGWTAVNGSW*GGQLRCVCVCTSHSSDSTRSSQRASKCHS 
FFILSQ*KT*SSWENWVFAKYSRiysyGHSCSKGRGD*DFK*Prv 
SQAR*SRFCGLCNPCGHCGLDINLRGGSSPWTDKHSCVHNNLLC 
NRRVFSLLCEGPGHCYQGAVCREACAAASPGLDSAAEPHRLCEH 
TD*LPK*GPGYIQHFHCDSNILCILYNfISFNLFSYSF*GVARYA 
C*RCHWYFEWLLYNHCGDILVACL*RRQL*SSQ 


S 708 


115 . 


1729 


TVGSWSRSGRSPPVGRQLLLTGRGAQAAGSPQGGMALQVELVPT 
GEIIRWHPHRPCiOALGSDGVRVTMESALTARDRVGVQDFVLi 
ENFTSEAAFIENLRRRFRENLIYTYIGPVLVSVNPYRDT.QiySR 
QHMERYRGVSFYEEPPHLLAVADTVYRALRTERRDQAVMISVES 
GAGKTDATKRLLQLYAETCPAPQRGGAVRDRLLQSNPVLEAFGN 
AKTLRNDNSSRFGKYMDVOPDFKGAPVGGKILSYLLEKSRWHQ 
NHGERNFHIFYQLLEGGEEETLRRLGLERNPQSYbYLVKGQCAK 
VflSINDKSDWKWRKALTVIDFTEDEVEDLLSXAASVLHLGNlH 
FAANEESNAQVTTEKQLKYLTRLLSVEGSTLREAIjIHRKIIAKG 
EELLSPLNLEQAAYARDAIxAKAVYSRTFTWLVGKlNRSLASKDV 
ES PSWRSTTVLGLLD 1 YGFBVFQHWSFEQFCINYCNEKLQQLF I 
ELTLKSEQEEYEAEG IAWEPVQYFNNKIICDI.VESKFKGr I\SI 
LDE\ECLRPGE 


6709 
~ 6710 


3 


894 


PPHEHLFPSGERGPFSFLVSRRGLGPGKMGKKGKKEKKGRGAE~ 
TAAKMBKKVSKRSRKEEEDLEALIAHPQTLDAKRTQTVELPCPP 
PSPRI^NASLSVHPEKDELILPGGEYFNGQKTFIiYNELYVYNIRK 
DTWTKVDI PSPP PRRCAHQAWVPQGGGQLWVFGGEFASPNGEQ 
FYHYKDLWVLHLATKTWEQVKSTGGPSGRSGHRMVAWKRQLILF 
GGFHESTRDYI y YNDVYAFNLDTFT;ys KLSPSGTGPTPRSGCQX 
I PSLPRAASSVYGGYSKQRVKKDVDKGTRHSDMP 


" ^711 


158 


980 


RHKMTWYRVES SSGRAARKMREALMGPAFI7\AIG Y IDPGNFATN 
IQAGASPGYQLLWVWWANI.MAMLIQILSAKLGIATGKNI1AEQI 
RDHYPRPWWPYWVQABIIAMATDLAEFIGAAIGFKLILGVSrjC 
QGAVLTGIATFLILMLQRRGQKPLEKVIGGLLLFVAAAYIVELI 
FSQPNIiAQr,GKGMVrPSLPrSEAVFLAAGVL\GATIMPHVI/YI 

WHSSLTQHLHGGSRQQRYSATKWDVAIAMTIAGFVNIAIMATAA 
SELNFYGHTGVA 




3 


347 


VTECKTMTCKMSQLERNI*TMINTLHHYSVKLGHPDTLIHG5PK 
ELVRTDrjlNILM:<ENKNDQAI*HIMEDLDrmAHMQIIFKELIML 
MAMLTW3YHDNMHDADYGPGQQHRPG 


6712 


118 


578 


PHGQKRTRYPQVRAPGQQPQAQIAMALCLKQVFAKDKTFRPRKR" 
FEPGTQRFELYKKACjaSLKSGLDLRSWRriPPGEWIDDWIAVHV 
VDFPNRINLIYGTMAERCS*TSCPVMAGGPRYEYRWQDERQYRR 
PAKLSAPRYMALLMDW I ESL I 


6713 


2485 


3 


QARGSDSEDGEFEIQAEDDARARKLGPGRPLPTPPTSECTSDVE 
PDTREMVRAQNKKKKKSGGPQSMGLSYPVPKGIMKKGYKVPTPI 
Urtix. J. xjTv j.uu(jruj WMTO/iK 1T,3oIjK.TACFIjIjPMFERIiKTHSAQTG 

aralilsptrelalqtlkftkeiigkftglktalilggdrmedqf 
aalhenpdiiiatpgrlvhvavkmslklqsveywfdeadrlfb 
mgfaeqlqei lari .pgghqtvlfsatlpkli.veparagltepvl 
irldvdtklkeqlktsfflvredtkaavllhllhnwrpqdqtv 

VFVATKHHAEYLTELLTTQRVSCAHIYSALDPTARKINLAKFTL 
QKCSTLIVTDIiAARGLDIPLLDNVINYSFPAKGKLFLHRVQRVA 
RAGRSGTAYSLVAPDEIPYLLDLHLFLGRSLTIiARPriKEPSGVA 
GVDGMLGRVPQSWDEEDSGLQSTLBASLELRGLARVADNAQQQ 
YVRSRPAPSPESIECRAKEMDLVGIiGLHPLFSSRPEEEELQRLRli 
VDS I KNYRSRATI FEINASSRDLCSQVMRAKRQKDRECAIARFQQ 
SQQGRQEQQEGPVGPAPSRPALQEKQPEKEEEEEAGESVEDIFS 
BWGRKRQRSGPNRGAKRRREEARQRDQBPyiPYRPKDFDSBRG 
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SEQ 
' ID 
No : 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end" 

nucleotide 

location 

c or r e spond i ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Ammo acid segment containing signal peptide 
(A=7aanine, C=Cysteine, D^Aspartic Acid E- 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
H=Histidine, 1=1 so leucine, K=Lysine, 
L=rLeucine, M==Methionine, Kr=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S«Serine, T=Threonine, V« Valine, 
W=:Tryptophan, Y=Tyrosine, X=Unknown, *-Stop 
Codon, /^possible nucleotide deletion, 
\«»po3sible nucleotide insertion) 










LS ISGEGGAFEQQAAGAVLDLMGDEAQNIiTRGRQQLKWDRFCkKR 
FVGQSGQEDKKKIKTESGRYISSSYKRDriYOlfW5fnirnvTn*Q*Y 

GRRRGILTjRRRPRTEEVGEARPLAQAGCIPGPHAPRHPLQAESA 
LELKTKQQ ILKQRRRAQKAALSLQRWWPQAALCPQ 




S714 
S715 


169 


1416 


wncqellppppapmahipsggapaagaapmgpqycvckvelsvs 

GQNLLDRD VTS KSDP FCVLFTENKGRW IE YDR tttti ttvtmt Kroa w 

SKKFVLDYHFEEVQKLKFALFDQDKSSMRLDEHDFLGQFSCSIiG 
TIVSSKKITRPLLLtiNDKPAGKGLrTIAAOELSnMRVTTT qt 

RRLDKKDLFGKSDPFLEFYKPGDDGKWMLVHRTEVIKYTLDPVW 
KPFTVPLVSLCDGDMEKPIQVKCYDYDNDGGHDPIGEPQTSVSQ 
MCEARDSVPLEFECINPKKQRKKKNYKNSGIIILRSCKIMRDYS 
FLDYrLGGCQLMFTVGIDPTASNGNPLDPSSLHYXNPMGTNEYL 
SAIWAVGQIIQDYDSDKMFPALGFGAQLPPDVJKVSHEFAINFNP 
TNPPCSGVDGIAQAYSACLP 




6716 


32 


493 


^^i'AGAESGSLHCLPATVQAIiAGAAHSPHGGQPPRRGPI.IGSGMP 
GKPKHLGVPNGRMVLAVSDGELSSTTGPQGQGEGRGSSLSIHSL 
PSGPSSPFPTEEQPVASWALSPBRH.QDPLGIAYPTEFI,KKEPS 
AENVTFWKACERFQQIPASDT 




6717 




176 


GAGGPAPRSFGSEEPRAALBRDKMSARAAAAKSTAMEETAIWEQ 
HTVTUIRVSLCCSK 


6718 


lis 


836 


LFAMSGFEtJIxNTDFYQTSYSIDDQSQQSYDYGGSGGPYSKQYAG 
YDYSQQGRFVPPDMMQPQQPYTGQIYQPTQAYTPASPQPFYGNN 
FEDEPPr*LBELGI WFDHI WQKTLTVLHPLKVADGS TMNIRTriT 
PMVFCLAFGATLLLAGKIQFG YVYGI SAIGCLGMPCL»LNLMSMT 
GVSFGCVASVLGYCLLPMILLSSPAVIFSLQGMVGIILTAGIIG 
WCSFSASKlFISALAMEGQQliliVAYPCALLYGVFALISVF 




290 


S99 


KQSSTVPGTILPSLKWHNSGLCKFPETGGKMTTFKEGLTFKDVA 
VIFTEEELGLriDPVQRNLYQDVMLENFRNLIiSVGHHPFKHDVFL 
LEKEKKLDIMKTATQ 


6719 
6720 


1 


691 


PTRPEBQDREDGKCHIOaBMHPTgr!^^''wnnpTaMo^r^o^gl^r^J^^ — 

DLDSr)DDKIEiCPWNPMKDSASQDNGI,SRKISRKRVCSSDSDSSL 
QWKKSSKARTGLLRITRRCAATAANKIKLMSDVEDVSLENVHT 
RSKNGRKKPLHLACTTAKKKLSDCEGSVHCEVPSEQYACEGKPP 

DPDSEGSTKVIiSQALNGDSDSEDMLNSEHKHRHTNIHKIDAPSK 
RKSSSVTSSG 




3 


822 


HBVAEEAGGTVYPQRGTMPGXKRFQHVIETPEPGKljfEitjTGYEAA" 
VPITEKSNPLTQDLDKADAENIVRLLGOCnaETPrjPRrinziT ofpv 

QRLYSESILTTMVQVAGKVQEVLKEPDGGI,WLSGGOTSGRHAF 

LMS VS FNQLMKGLGQKPLYTYLIAGGDRSWASREGTEDSALHQ 

lEELKKVAAGKKRVIVIGISVGLSAPFVAGQMDCCMNNTAVFIiP 

VLVGFNPVSMARHPFPPPRILRSLTVFPSLRAPHYOtTSLLFSM 
SWTIilSE 


6721 


3 


822 


HBVAEEAGGTVYPQRGTMPGTKRFQHVIETPEPdKWELTGYEAA 
V f X J. ^.i^^iMFij f QDIjOKAJjAENIVRLiLGQCDAEI fqeegqalsty 

qrlysesllttmvqvagkvqeviikepdgglwlsgggtsgrmap 
lmsvsfnqlmkglgqkplytyliaggdrswasregtedsalhg 

IEE3LKKVAAGKKRVIVIGISVGLSAPFVAGQMDCCMNNTAVFLP 

VLVGPNPVSMARHPFPPPRILRSLTVFPSLRAPHYOirSIiLFSM 
SWTLISE 


6722 
672i 


1 


390 


RSWSKRTWQALfMAVLiFLLLFLCGTPQAADNMQAIYVAL^SEAV^ 

LPCPSPSTLHGDEHLSWPCSPAAGSFTTLVAQVQVGRPAPDPGK 

PGRESRLRLLGNYSLWLEGSKEEDAGRYWCAVLGQHHNYQNW 




173 


659 

( 
3 


rcQYCTARMADFGISAGQFVAWWDKSSPVEALKGLVDKLQALT 
SNEGRVSVENIKQLLQSAHKBSSFDIILSGLVPGSTTLHSAEIL 
\EIARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALTLSGL 
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SSQ 
ID 
WO: 


1 Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F^Phenylalanine, G:=Glycine, 
H=Histidine, I=Isoleucine^ K^Lysine, 
L^Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q^Glutamine, R«Arginine, 
S^Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=Unknown, ♦^Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6724 
6725 


173 


659 


VEVKELQREPLTPEEVOSVREHLGHESDNL 

VCQYCTAHMADFGISAGQFVAWWDKSSPVEAI.KGLVDKLQALT 
GNEGRVSVENIKQLLQSAHKESSFDIILSGLVPGSTTLHSAEIL 
AE I ARILRPGGCLFLKEPVETAVDNNSKVKTASKLCSALTLSGL 
VEVKELQREPLTPEEVQSVREHLGHESDNL 




356 


722 


RRRTPPVHiATMDDDLMLALRLQEEWNLQEAERDHA;t5ESLS]:>VD 
ASWELVDPTPDLQALFVQFNDQFFWGQLEAVEVKWSVRMTLCAG 
ICSyEGKGGMCSIRLSEPLLKItRPRKDLVEVPFV 


6726 
6727 


98 


714 


HX.gKMERKlJaRREKEKEYEGKHNSLEDTDQGKNCKSTLMT"OJVG 
GYLYITQKQTLTKyPDTFLEGIVNGKILCPFDADGHYPIDRDGL 

LPRHVIiNFLRNGELLLPEGFRENQLIAQEAEFFQLKGLAEEVKS 
RWEKEQLTPRETTPLE ITDNHDRSOfiTiP T PrMRPnPT'cirT b-co t 

VLVSKSRLDGFPEEPSISSNIIQFKYFIK 


6728 


1 


831 


rRGMGDERPHYYGKHGTPQKYDPTFKGPIYNRGCTDIICCVFLL 
lAI VGYVAVGI lAWTHGDPRKVI YPTDSRGEFCGQKGTIOTENKP 
YLFYPNIVKCASPLVLLEFQCPTPQICVEKCPDRYLTYLNARSS 
RDFHYYKQPCVPGFKNNKGVAEVLRDGDCPAVLIPSKPIiARRCF 
PAIHAYKGVLMVGNETTYEtKSHGSRKNITDtiVEGAKKANGVLEA 
RQLAMRIFBDYTVSWYMDIISLG1AMAMSLI.FIILLRFLAGIMG 
RGMIIMGILVliGY 






935 


FCSSWLRSX^DSSLSWKMFIiVGLTGGIASGKSSVIQVFQQLGCA 
VlDVDVMARHVVOPGYPAitHI7TUT?VPri'rc^7T t T?vTr«ptTivm vitt ^.t^ 

LIFNQPDRRQLLNAITHPEIRKEMMKETFKYFIiREPRTSPRQKK 

hvpsalkeadslmrrdt 


6729 
6730 


259 


1191 


VGi.TGAQSGKTASMGRDQRAVAGPAI»RRWIiLLGTVTVGFliAQSV^ 
LAGVKKPDVPCGGRDCSGG CO CYPEKGGRGO pa P vn xsnr* V wnn T> 

GLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVSGFPGADGI 
PGHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGPTGPPGPQG 
PKGQKGBPYALPKEERDRYRGEPGEPGLVGFQGPPGRPGHVGQM 
GPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKGEKGDVGQPGPN 
GIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGSE 


6731 


784 


1015 


NMVDYYEVLGI>QRYASPEDIKKAYHKVALKWHPDKNPENKEEAE 
RKFKEVAEAYEVLSNDEKRDI YDKYGTEGIiNE F 


6732 


1 


446 


eiRKRLHGAWPRVEVGCPWETRESEGVHLERPTSPLKNNDEGS 

LDI YAGLDSAVSDSASKS CVPSRWCLDLYEE ILTEEGTAKEATY 

NDLQVEYGKCQLQMKELMKKFKEIQTQNFSLINENQSLKKNXSA 
LlKTARVEtNRKDSEt 


6733 


102 


1205 


GRWQRRPPPPSPPLWCLQPGGGSDPQQLTQLRHCLSHSPQDTPW 
AQRQVCYTAATTQAAAPATRKCLPDKSGHRPTPPRSHRHHRQEN 
LGS IKPSSRSTKATSTnviAGDGRRAEAVREGWGVYVTPRAPlRE 
GRGRLAPQNGGSSDAPAYRTPPSRQGRREVRFSDEPPSVYGDFE 
PLVAKERSPVGKRTRLEEFRSDSAKEEVRESAYY1,RSRQRRQPR 
PQETEEMKTRRTTRLQQQHSEQPPIiQPSPVMTRRGLRDSHSSEE 
DEASSQTDLSQTlSKKnVRSIQEAPAVSEDLVIRLRRPPLRYPR 

YEATSVQQKVNPSEEGETEEDDQDSSHSSVTTVKARSRDSDESG 
DKTTRSSSQYIESFW 


6734 


613 


1311 


RSCRQVGMRSRNQGGESASDGHISCPKPSIIGNAGEKSLSEDAK 
KKKKSNRKEDDVMASGTVKRHLKTSGECERKTKKSLELSKEDLI 
QLLSIMEGELQAREDVIHMLKTEKTKPEVLEAHYGSAEPEKVLR 
VLHRDAILAQEKSIGEDVYEKPISELDRLEEKQKETYRRMLEQL 
LIAEKCHRRTVYELERrEKHKHTDYMtTKSDDFTNLLEQERBRLKK 
[iLEQEKAYQARKE 




189 


551 

3 


SAAMFPVFSGUfgELQEKNKSLELVSFEEVAVHFTWEEWQDLDD 
tVQRTLYRDVMliETYSSLVSLGHCITKPEMIFKLEQGAEPWIVEE 
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SEQ 
ID 


Preaicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acia segment containing signal peptl-^e" 
(A=Aaanine, C=Cy3teine, D^Asnartie Aniri v 
Glutamic Acid, P=Phenyl alanine, G=Glycine, 
H=Histxdine, I^'Isoleucine, K==Lysine 
L==Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, X=UnkAown, *«Stop 
Codon, ./-possible nucleotide deletion, 
\=^pos3ible nucleotide insertion) 


6735 
S736 


280 


558 


TLNLRLSGuaiCKQVFSGICHRSLVELQEVHLV 

KSRRAGVT^SNPFLKQVFNKDKTFRPKRKVKyGTQRPELHKKA 
Q^^AGLDLRLAVQLPPGEDLNDWVAVHWDPFNRVNLIYGTI 


5737 


195 


808 


SSSNPYLIRRIEELNKTASGNVEAKWCFYRRRDISNTLIMLAD 

KHAKEIEEESETTVEADLTDKQKHQLKHRELFLSRQYESt.PATH 

IRGKCSVALLNETESVLSYLDKEDTFFYSLVYDPSLKTLLADKG 
EIRVGPRYQADIPEMLLEGTFFCVPAVL 


6738 


ISO 


1209 


PVlMPJaHFSPGmVRPSCCVSSSPKLRRKAHSHX^SYRPDTDLS 
REDTGCNLQHISDRENIDDLNMEFNPSDHPRASXIFLSKSQTDV 
REKRKSLFINHHPPGQIARKYSSCSTIFIiDDSTVSQPNLKYTIK 
CVALAIYYHI KNRDPDGRMLLDIFDENLHPLSKSEVPPDYDKHN 
PEQKQIYRFVRTLFSAAOLTAFriiTVTT \nrr TrarTw^T^^^-^^^ 
•K, It— **v* » ^'k-.t jji-isrirt^iji/^v^/tj. V iJbViijERLiLTyAEIDlCP 

ANWKRIVLGAIUj^KVWDDQAVVraTOYCQII^ITVEDMKrEi:^ 
RQFLELLQFNINVPSSVYAKYYFDLRSLAEANHLSFPLEPLSRE 
RAHKLEAISRI^CEDKYKDLRRSARKRSASADNLrLPRWSPAIIS 


6733 


148 


6b3 


CACAEQPAkAiSVGAATALPVRWASGEMAPSGSLAVPLAVriVLLr, 
WGAPWTHGRRSNVRVI TDEHWREritjPfinMM t pwva n^r^ny. /-.^vt, 

QPEWBSPAEWGBDLEVNIAKVDVTEQPGLSGRFIITALPTIYHC 
KDGEFRRYQGPRTKKDFINFISDKEWKSIEPVSSWF 




3 


631 


SWPWiHCj^iSVAKI.EKHLMt^RQEYVKLQKKIAETEKI^CALL^ 
ANKESSSESFISRLLAIVADLYEQEQYSDLKIKVGDRHISAHKP 
VLAARSDSWSIANLSSTKELDLSDANPEVTMTMLRWIYTDELEF 
REDDVFLTELMKIANRFQLQLLRERCEKGVMSLVNVRNCIRPYO 


5741 


3 


631 


SWPDMAEEBVAM.i!;KHi^LLRQEYVKI.QkKlJU£TEKRCAlJ>AAQ^ 

ANKESSSESFISRLIAIVADLYEQEQYSDLKIKVGDRHISAHKF 

VLAARSDSWSLANLSSTKELDLSDANPEVTMTMUewIYTDELEF 

REDDVFI.TELMKLANRPQLQLLRERCEKGVMSLVNVRNCIRFYQ 

TAEELNASTLMKTYCAEIIASHWVSEVEGVNKAL 


6 742 


141 


9^0 


PLTLPPSSRAi^HTMNTSPGTVGSDPVILATAGYDHTVRFWQA 
n^yj J. V. J. K i w wnyusgvwADEVTPDRSMIAAAVQ P VSLG YQHIRM 
YDLNSNNPNPIISYDGVNKNrASVGFHSDGRWMYTGGEDCTARI 
WDLRSRNLQCQRIFQVNAPINCVCLHPNQAELIVGDQSGAIHIW 

DLKTDHNEQLIPBPEVSITSAHIDPDASYMAAVNSTLVPFSCLL 
P^^lC31LQE!GEFESliARRGT J.T?T.ar«or'Kir'v\TT.TKTT frv^r^-r^-r%<n.M.^ 
^ jii3iu-i«Ki,?ijjji*- ijAL.ytflNCYVWNIjTGGIGDEVTO 
LIPKTKIP 




6743 


141 


960 


PbTLPFSSRAKAGHTMNTSPGTVGSDPVIIATAGYDHTVRFWQA 
HSGICTRTVQHQDSQVNALEVTPDRSMIAAAVQPVSLGYQHIRM 
YDLNSNNPNPIlSYDGVNKIiriASVGFHEDGRWWYTGGEDCTARI 
WDLRSRNLQCQRIFQVNAPINCVCLHPHQAELIVGDQSGAIHIW 
DLKTDHNEQLlPEPEVSlTSAHIDPDASYMAAVNSTLVPPSCLIi 
PIAIGILQEGEFESLARRGLLFLACQGNCYVWNLTGGIGDEVTQ 




6744 


1 


412 


MHSTQDKSLHl^iSGDPNPSAAPTSTCAPRKMPKRISISKQIASVK 

ALRKCSDLEKAIATTALIFRWSSDSDGKLEKAIAKDIiLQTQFRN 

FAEGQETKPKYREILSELDEHTENKU^FEDFMILLLSITVMSDIi 
LQNIR • 






95 


1343 . ] 
I 
I 
1 
\ 

r 


^TPARWHCAGCKVLSRFSSPNKASSFALQSAGGGLPAVRALRRD 
iQKVSTVOYGMDEVEQDQHEARLKELFDSPDTTGTGSLGQEELT 
^LCHMLSLEEVAPVLQQTLLQDNLLGRVHPDQPKEAIiILILSRT 
iSNEEHFQEPDCSLEAQPKYVRGGKRYGRRSLPEFQESVEEFPE 
rTVlEPLDBEARPSHIPAGDCSEHWKTQRSEEYEAEGQLRFWNP 
>DLrf7VSQSGSSPP0DWIEEKLQEVCEDLGXTRDGHLWRK?CLVSI 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end" 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
* ^=v-yfi»t.cine, ju— Asparcic Acicl/ E= 
Glutamic Acid, F-Phenylalanine, G=^Glycine, 
H=Histidine, Islsoleucine, K^hysine, ' 
L=sLeucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R^Arginine, 
S=Serine, TaThreonine, V= Valine, 
WsTryptophan, Y=Tyrosine, X^Unknown, *=stop 
Codon, /«poBsible nucleotide deletion, 
\=pos3ible nucleotide insertion) 








CBQYGIiQN VDGBMIiEEVFENLDPDGTMS VEDFFYGLFKNGKS LT " 
PSASTPYRQLKRHLSMQSFDESGRRTTTSSAMTSTIGPRVFSCL 
DDG^5GHASVERILDTWQEEGIENSQE1I,KALDFGLDGNINLTEL 
TIiALEWELLVTKNS IHQACI 


6745 


1 


588 


TFRDQGWAQRRRWLLGCASWESWEAAIAAGPGLPSSTARQQNNP 
AAGTEC FAAVWARGTAMGS VLS TDS GKSAPR <; aTaw a t irw b o n d 

ELPVTSFDCAVCLEVIiHQPVRTRCGHVFCRSCIATSLKNNKWTC 
PYCRAYLPSEGVPATDVAKRMKSEYKNCAECDTLVCLSEMRAHI 
RTCQKYIDKYGPLQELEETA 


6146 


110 


492 


GATGAMAESAPARHRRKRRSTPLTSSTLPSQATEKSSYFQTTEI " 
^i^w^j. w v*t**xy*ivBjvju*iiiay>wu<jjybJjJiOKiT^iAEKKIiADCEKMA 
VEFGNQLEGXWAVLGTLLQEYGLLQRRIiBNVENLLRWRN 


S747 " 


24 7 


484 


EAVTFKDVAWFTEEELGIiLDLAQRKLYRDVMIiENFRWLLSVGH^ 
QPFHRDTFHFLREEKFWMMDIATQREGNSVYAGVC 


S748 


201 


665 


MTT F KEAVTF KDVA WFTEEELGLLDP AQRKL YRDVMtSNFRNL 
LSVGNQPFHQDTFHFLGKEKFWKMKTTSQREGNSGGKIQIEMET 
VPEAGPHEEWSCQQIWEQIASDLTRSQNSIRNSSQFFKEGDVPC 
Q I EARLS I SX VQQXP YRCNBCKQ 


6749 


95 


719 


RREVKG(3DGVCPRARGSPQSQQFPSCAGGGEGLQQSGEALDGAM^ 
SAGGPCPAAAGGGPGGASCSVGAPGGVSMFRWLBVLEKEFDKAP 
VPVDLLLGEIDPEJQADITYEGRQIOkJTSLSSCPAQLCHKAQSVSQ 
INHKLEAQLVDIiKSELTEa-QASKWLSKEVHDQLLQLHS IQLQL 
HAKTGQSADSGTIKAKLSGPSVEELERBLKAN 


6750 


3 


42B 


SCESRRPGAKl^VWASGALPRDTTGLGSEQPSGDVAQSNRATMGT " 
TAPGP I HLLELCDQKLMBFLCNMDNKDLVWL EE I QE EAERMFTR 

EFSKEPELMPKTPSQKNRRKKRRISYVQDENRDPIRRRLSRRKS 
RSSQLSSRR 


S751 


152 


1417 


PTKATEMAGASVKVAVRVRPFNSREMSRDSKCI t QI^lSGSTTTI V 
NPKQPKETPKSFSFDYSYWSHTSPEDINYASQKQVYRDIGEEML 
QHAFEGYNVCIFAYGQTGAGKSYTMMGKQEKDQQGIIPQLCEDL 
FSRINDrTNDNMSYSVEVSYMElYCBRVRDLLNPKNKGNLRVRE 
HPLLGPYVEDLSKLAVTSYNDIQDLMDSGNKARTVAATWMNETS 
SRSHAVFNIIFTQKRHDAETKflTTEKVSKISLVDLAGSERADST 
GAKGTRLKEGANINKSLTTLGKVISAIiAEMDSGPNKNKKKKKTD 
FIPYRDSVLTWIiLRENLGGNSRTAMVAALSPADINYDETLSTLR 
YADRAKQIRCWAVri^DPKNKLIRELKDEVTRLRDLLYAQGLGD 
ITDMTNALVGMSPSSSLSALSSRNV 


6752 


24 


1834 


RNCVPPLGCYRSRVKPHSDIKMQYSHHCEHLLERLNKQREAGFL ^ 

VKADGPQKIjLEFIYTGTIiNLDSWNVKEIHQAADYLKVEEWTKC 
KIKMEDPAFIANPSSTEISSITGNIELNQQTCLLTLRDYNNRBK 

sevstdliqanpkqgalakkssqtkkkkkafwspktgqnktvqy 
psdilenasvelfldankbptpweqvaqindkseleltswen" 
tfpaqdivhtvtvkrkrgksqpncalkehsmsniasvkspyeae 
nsgeeldqryskakpmcntcgkvfseasslrrhmrihkgvkpyv 
chlcgkaftqcnqlkthvrthtgekpykcelcdkgpaqkcqlvf 

HSRMHHGEEKPYKCDVCNIiQFATSSNLKIHARKHSGSKPYVCDR 

cgqrfaqastltyhvrrhtgekpyvcdtcgkafavssslithsr 

KHTGE KPFI CELCGWS YTD I KNLKKHKTKTHSGADKTLDSSAED 

htlseqdsiqksplsetmdvkpsdmtlpialplgtedhhmllpv 
tdtqsptsdtllrstvngysepqli flqqly 


6753 


2 


1305 


VPSIjPYPPOKWAHTEFTTSSDSETANGIAKPDPVMPGGEBKAS " 
PFGIKLRRTNYSIiRPNCDQQAEQKKKKRHSSTGDSADAGPPAAG 
SARGEKEMSGVALKHGPSLPQERKQAPS trrdsae psssrsvpv 
(UJPGPPPASSQTPAPEHDKAAMKMPLAQKPALAPKPTSQTPPAS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acia segment containing signal peptide"- 
(A:-Alanine, C«Cysteine, DoAspartic Acid, E= 
Glutamic Acid, F=. Phenylalanine, G=Glircine, 
H-Histidine, I=Isoleucine, K=Lysine, 
I*=Leucine, M=Methionine. N^Asparagine 
P=Proline, Q^Glutamine, R-Arginine, 
S=Serine , T-Threonine , V=Val ine , 
W=Tryptophan, Y=Tyxosine, X=Unknown, *«Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PLsisj.aKPrLVEX^LSRRAGRPDPEPSEPSKh'DQESSDRRPPSPP'- 

GPEERKGQKRDBBEEATERKPASPPLPATQQEKPSQTPEAGRKE 

KPMLQSRHSIJ)GSKLTEKVETAQPLWlTLALOKQKGFREQQATR 

EERKQAREAKQABKLSKEWVSVSVQPGSSSVSRAGSLHKSTALP 

EEKRPETAVSRLERREQLKKANTLPTSVTVEISYSSPAAPLVKE 

VSKRFSSPDDAPVSSEPAWIALAKRKAKAWSDCPLIIK 


O f 3*± 

6755 


2 


413 


tVKRRRRRLGGPEVWTMSSLHKSRIADFOUVJbKSPSIALEKLRE 
LSFSGIPCEGGLRCLCWKI LLNYLPLERASWTS 1 tAKQRELYAQ 
^^^^MIIQPGIAKANMGVSREDVTFEDHPLNPtrPDSRWNTYFKD 


6756 


298 


1343 


PGLPLQVAI.£;adwFLDMPGGRRGPSRCX3LSRSAIiPSLQTLVGGQ ' 
CGWGTGLRNRNGSAIGLPVPPITALITPGPVRHCQIPDLPVDGS 

llpeplffiyllvalpiqyiniyktvwwypynhpasctslnfhl 

IDYHUU^FITVMLARRLVWALISEATKAGAASMIHYMVLISARL 
VLLTLCGWVLCWTLVWLFRSHSVLNLIiFLGYPFGVYVPLCCFHQ 
DSRAHLLLTDYWYWQHEAVEESASTVGGLAKSKDFLSLLLESL 
KEQFWNATPIPTHSCPLSPDLIRNEVECLKADFNHRIKEVLPNS 
LFSAYYVAFIiPLCFVKVSGYLTFMCFLDLCVNYINWVFLV 


6757 


180 


754 


Xfc:KAl*GSLPx.sa I?VSWGSLRTLKYQQQPLUPKV1,LCQTRVQCHD ' 

LRSLQPQPPGLKQSFCLRVLGLQTGATTPGLRD3UTCKELIILTE 

REAQKRKKRKEKESGMALTQGPLTFRDVAIEFSQEEWKSLDPVQ 

KALYWDVMLENYRNLVPLGKDNFALEVKICPRVFLYFLCCLSWE 
PFHYLTETEALLTHK 


D / SO 


2 


459 


NSRVEAPEAH5RESQGSDAMRKHLSWWWLATVCMLLFSHL5AVQ 
TRGIKHRIKWNRKAX,PSTAQITEAQVAENRPGAFIKQGRKLDrD 
FGAEGNRYYEMJYWQPPDGIKYNGCSEANVTKEAFVTGCIMATO 
AANQGBPQKPDNJCLHQQVIiW 


6759 


1 


1008 


AIj^jPKX«PGRRyRDRAPWLPAkLLRGVIAVWVSLSAI.GPGSFCRR " 

RVPSLAQLGHSEAAPSPDDVRWSRVPDRCPEERDRAWPPPPPPS 

LPPSFRRNMANNSPALTGNSQPQHQAT^AAAAQQQQQCGGGGATK 

PAVSGKQGNVLPLWGNEKTMWLNPMILTNILSSPyPKVQLYELK 

rYHEWDEIYFKVTHVEPWEKGSRKTAGQTGMCGGVRGVGTGGI 

VSTAFCLLYKLPTLKLTRKQVMGLITHTDSPYIRALGFWYIRYT 

QPPTDLWDMFESFLDDEEDLDVKAGGGCVMTIGBMLRSFLTKLE 

WPSTLPPRIPVPVQKNIDQQIKTRPRKI 


6760 " 


1 


513 


KKHNFHSLDGl-STRAFHPQTGLPLLSSPVPQRKU-QSGCFDLDSS— 
LLHLKSFSSRSPRPCLNIBDDPDIHEKPFLSSSAPPITSLSLLG 
NFEESVLWYRFDPLGrVDGFTABVGASGAFCPTHLTLPVEVSFY 
SVSDDNAPSPYMGVITLESLGKRGYRVPPSGTIQWCVL 


6761 


239 


606 


VLSKKKGLSAEEKRTRMMbii FSKTKDVFQLKDIJJKIAPKSKGIT ' 

AMS\^VLQSLVDIX3MVIX:ERIGTSNyYWAFPSKALHARKHKLE 

VLESQLSEGSQKHASLQKSIEKAKIGRCETEERT 




29 


1733 

] 

1 

I 

j 
£ 

1 ^ 

£ 


aKTbRGLREVAAJr-SDVADAAVSRRGRCCCCLHCTQTQVAQDCPS"" 
SSSSVORCELSLFO^STjHTMTQWt \mc?/»r»y^7vr\t\n* -M^... «... 
w w u w \^cs.\,a±uuc\ia xtn Lnla a K.u vNS VAGCADDALAGL VACNP 

NLQLIiQGHRVALRSDLDSLKGRVALLSGGGSGHEPAHAGFIGKG 
MLTGVIAGAVFTSPAVGSILAAIRAVAQAGTVGTLLIVKWYTGD 
RLNFGLAREQARAEGIPVEMWIGDDSAFTVr.KKAGRRGr,CGTV 
CIHKVAGALAEAGVGLEEIAKQVNWTKAMGTLGVSLSSCSVPG 
SKPTFELSADEVELGLGIHGEAGVRRIKMATADEIVKLMJCDHMT 
^TTNASHVPVQPGSSWMMVNNLGGLSFLELGIIADATVRSLEG 
^GVKIARALVGTFMSALEMPGISLTLLI.VDEPLIjKL1DAETTAA 
WPNVAAVSITGRKRSRVAPAEPQEAPDSTAAGGSASKRMALVL 

jrvcstllgleehlnaldraagdgdcgtthsraaraiqewlkeg 

»PPASPAQIJ[,SKLSVLLLEKMGGSSGALYGLFLTAAAQPLKAKT 
XPAWSAAMDAGLEAMQKYGKAAPGDRTMLDSLWAAGQEL 
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SEQ 
ID 

"KtfW . 
Vt\J Z 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 
location 
cor re spond i ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C^Cyeteine, D«Aspartic Acid, E« 
Glutamic Acid, Phenyl alanine, G=Qlycine, 
HtsHistidine, I^Isoleucine, K=Iiysine, 
L=Leucine , M=Methionine , N=^paragine , 
P«Proline, Q=Glut amine, R=»Arglnine, 
S=i Serine, T=Threonine, V= Valine, 
W«Tryptophan, y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


S762 


3 


613 


ASTISWRLCVAGAEARRPVPVAGERAGGGAMWFMYLLSWLSLFI 
QVAFITLAVAAGLYYLAELIEEYTVATSRIIKYMIWFSTAVLIG 
LYVFERFPTSMIGVGLFTNLVYFGLLQTFPFIMLTSPNFILSCG 
LVWNHYLAFQFFABE yy PFSEVLAYFTFCLWr I PFAFFVSLSA 
GENVLPSTMQPGDDWSNYFTKGKRGK 


6763 


2 


760 


SGPDFPGRRFRGCCCVRPPAGAGMELGGHWDMNSAPRLVSETAE 
RKQEQKTGTEAEAADSGAVGARRFLLCLYLGGFLDLFGVSMWP 
LtjSLHVKSLGASPTVAGIVGSSYGILQLFSSTLVGCWSDVVGRR 
SSLIACILLSAIiGYLLLGAATNVFIiFVLAEVPAGIFKHTLSISR 
ALLSDWPEKERPLVIGHFNTASGVGFILGPWGGYJ.TELBDGP 
YLTAF I CFLVFI LNAGLVWFFPRREAKPGSTE 


S764 


80 


438 


LKKMDTMMLSVRNLFEQLVRRVEILS EGNBVQFI QLAKDFEDFR " 
KKWQRTDHBLGKYKDLIiMKAETERSALDVKLKHARNQVDVEIKR 
RQRABADCEKLERQI Qlil REMLMCDTSGSI Q 


6765 


3 


550 


ARYSRVDHFCRRRCRAVARAPRFLLQFPSGPSRHPLAACVARWL " 
RG5 VLVSEAIiSGSAKDGI VTE VAVGVKRGSDELLSGSVLSS PNS 
NMSSMWTANGNDSKKFKGEDKMDGAPSRVLHIRKriPGEVTETE j 
VIALGLPFGKVTNILMLKGKWQAFIiEIATEEAAITNGNYYSAVT ' 
PHLRNQ 1 


^7^6 


1 


1287 


EGGSFKASLTWliWPLGEMKLHCEVEVISRHLPALGIjRNRGKGVR 1 
AVLSLCQQTSRSQPPVRAFLLISTLKDKRGTRYEIiRENIEQFFT | 
KFVDEGKATVRXiKEPPVDICliSKANSSSLKGFIiSAMRIAHRGCN 
VDTPVSTLTPVKTSEFENFKTKMVITSKKDYPLSKNFPYSLEHL 
QTSYCGLVRVDMRMLCLKSLRKIiDLSHNHIKKLPATIGDLIHLQ 
ELNIiNDNHLBSPSVALCHSTLQKSLWSLDLSKNKIKALPVQPCQ 
LQEIiKNLKLDDNELlQPPCKI GQIiINLRFLSAARNKLPPLPSE P 
RtII.SLEYLDLFGK(TFEQPKVLPVIKI.OAPLTLLESSARTILHNR 
I PYGSHI IPFHLCQDUDTAKICVCGRFdtNSFIQGTTTMNUISV 
AHTWIiVDNLGGTEAP 1 1 SY FCSLGCYVNSSDI 


6767 


336 


919 


APMICLCSSDLQFRYKEAFLRDRGLQIGYCSVDDDPRMKHFLNV 
GRLQSDNEYKKDFAKSRSQFHSSTDQPGLUJAKRSQQLASDVHY 
RQPLPQPTCDPEQXiGLRHAQXAHQLQSDVKYKSDLKLTRGVGWT 
PPGSYKVEMARRAAELANARGLGLQGAYRGAEAVEAGDHQSGEV 
NPDATEILHVKKKKALLL 


67^8 . 


2 


363 


PGS TIS C J(n.LSEGSLPLCMQVACGBEKHRAPTMKTLRARFKKTS 
LRLSPTDLGSCPPCQPCPIPKPA/VRGRRQSQDWGKSDERLLQAV 
ENNDAPRVAALIARKGLVPTKLDPEGKSAFHL 


6769 


284 


396 


MSTPDFSTAENNQELANEVSCiaCAMLTLMLQAMGQAD 


6770 


1 


397 


QRHYQVIWSSTMAKLHDYYKDEWKKLMTEFKYNSVMQVPRVEK 
IIliNMGVGEAIADKKLLDtlAAADLAAISGQKPLITlCARKSVAGF 
KIRQGYPrGCKVTLRGERMWEFFERLITIAVPRIRDFRGLSAKS 


6771 


3 


378 


APAGTIiAMTGKSVICDVDRyQAVriAWLLLEEDWKFCADCQSKGPR 
WittjWXi itVrlCIRCAGIHRNLGvHISRVKSVwIjDQWTQE^I QCM 
QSMGNGKANRLYEAYLPETFRRPQIDPYLFWSNLEG 


6772 


1 


1400 


AAAFLQGMTVNGFINTVITS L \ERR YDLHSYQSGLIASS YDI AA 
CLCLTFVSYPGGSG\HKPRWLGWGR\VLMGTGSLVPALPHFTAQ 
P**GWKtjDAGVRTCPANPR\PVCAG\HTSGLSRYQLVFMLGQFL 
HGVGATPLYTIiGVTYLDENVKSSCSPiyiAIFYTAAILGPAAGY 
LIGGALLNIYTEMGRRTELTTESPLWVGAWWVGFLGSGAAAFFT 
A7PILGYPRQLPGSQRYAVNRAAEMHQLKDSSRGEASNPDFGKT 
IRDLPLSIWLLLKNPTFIIiLCLAGATEATLITGMSTFSPKFLES 
QFSLSASEAATLFGYLWPAGGGGTFLGGFPVNKLRLRGSAVIK 
FCLFCTWSLLGILVPSLHCPSVPMTVGVTASYGGSLLPEGHIiNL 
TAPCNAACSCQPEHYSPVCGSDGLMYFSLCHAjGCPAATETNVDG 
QKVYRDCSCIPQNLSSGPGHATAGKCTST 
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SEQ 
ID 
NO: 

6773 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment: containing signal peptide 
(A^Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
ij-ueucine, tvi=iyiecnionine, NsAsparagine, 
P=Proline, Q=Glut amine, R^Arginine, 
S=Serine, T^^Threonine , V=Valine, 
W«Tryptophan, Y-Tyrosine, X=0nknown, *=.Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 




1 


630 


PWEAPKEHKiTKAEEHTWLTVTGEPCHFPFQYHRQLYHKCTHKG 
RPGPQPWCATTPNFDQDQRWGYCLEPKKVKDHCSICHSPCQKGGT 
CVNMPSGPHCLCPQHLTGNHCQKEKCPEPQLLRPPHKNSIWYRT 
EQAAVARCQCKGPDAHCQRLASQACRTNPCLHGGRCLEVEGHRIi 
CHCPVGYTG?PCDVGE*GSGASRRPAPRWDGLAR 


6774 
6775 


146 


389 


liTELSDQQY?LFFILSS/WVPTFLSMDVDGRVIKADSFSKliSS 
_ GLHIGFLTGPKPLIERVILHIQVSTLHPSTFNQLMISQ 


6776 


104 


614 


i CPSQLRVLTARGGRRAP^PQLWTLVLALIEEKWRSHRILRMNS ■" 
GRPETMENLPALYTIFQGEVAMVTDYGAPIKIPGCRKQGLVHRT 
HMSSCRVDKPSEIVDVGDKVWVKLIGREMKNDRIKVSLSMKWN 
. QGTGKDLDPNNV\SLSKKRGGGDPSRITLGRR3PLRLS 


Sill 


3 


1108 


HEaiHERHEGALSQDAI.LRIS I PI/DSKMRPEKCRRF VHPQWQLIjH 
LNGTFPKTSDADMEPCVDGWVYDRISFSSTIVTEWDLVCDSQSL 
TSVAKFVFMAGMMVGGXLGGHLSDRFGRRFVLRWCYLQVAIVGT 
CAAliAPTFLIYCSLRFLSGIAAMSLlTNTIMLIAEWATHRPOAM 
GITLGMCPSGrAF^4TLAGLAFAIRDWHILQLWSVpyFVIPLTS 
SWLLESARWLIr^aKKPEEGLKELRKAAHRSGMKNARDTLTLEIL 
KSTMXKELEAAQKKKPPLGERIiHMPNICKRISLLPFTKPANFyA 
YPGLNIiHG/LKHLGNNVFLLQTLPGAV/TPPGQLVLHLGHWGSG 
RVSSRGRVKCLGIiFVLQVW 


6118 


119 


63 


CFFHGPAWRutzSVRATFAKKQGQSGIISCIAFSPAQPLYACGSy 

GRSLGLyAWDDGSPLALLGGHQGGITHLCFHPDGHRFFSGARKO 

AELLCWDLRQSGYPLWSLGREVTTNORIYFDliDPTCQFLVSGST 

SGAVSVWDTDGPGNDGKPEPVLSPLPQKDCTNGVSLHPSLPLLG 

HCLPVSVCPLSPTESGGRRRGAGPSLGSPRRHVHLECRLQLWWC 
CGGARLQHP* *SPRARKGR 




311 


805 


i«sitdesrgsirrknpantrlrlnvp\ebtagdse/erspeee 
vqadprirsaspkcptsspfpkgrspbgegetNdpekvhfhpgp 
kdksvaekn\kgp\spvssegikdfpsmkpewenlnqsnvrrmh 
t\avrlnevivkksrdaklvllnmpgpprnrkgdeny 


6779 
6780 


2 


53 S 


RAIJlRQPRLlJU^GIEPESMAISBPrKGSKKPCVNKEErJVLKKP 

makcawkgpreppqdaraeaespggasesdqdgghesppkkkav 

AWVSAKNPAPMRKKKKVSLGPVSYVLVDSEDGRKKPVMPKKGPG 

srreasdqkaprgqqpaeatastsrgpkakpegsprratnesrk 


6781 


3 


403 


HhVNDNKPBij>riNrjyrSPGKEEISYI?EGDPIDTFVALVRVQDKD 

sglngeivcklhghghfklqktyennyliltnatldrekrseys 
ltviaedrgtpslstvkhptvqindindnpphfqrsryefvxse 


6782 


1 


1269 


APTRPVFPTLyDIiSSSKEPSNSLNLPHSNELCSSLVHPELSEVS 
SNVAPSIPPVMSRPVSSSSISTPLPPNQITVFVTSNPITTSANT 
SAALPTHLQSAI/MST\ArrMPNAGSKVMVSBGQSAAQSNARPQFI 
TPVFINSSSIIQVMKPSQPSTIPAAPLrTNSGLMPPSVAWGPL 
HIPQNIKFSSAPVPPNALSSSPAPNIQTGRPLVLSSRATPVQLP 
SPPCTSSPWPSHPPVQQVKELNPDEASPQVNTSADQWTLPSSQ 
STTMVSPLLTNSPQSSGNRRSPVSSSKGKGKVDKIGQILLTKAC 
KKVTGSLBKGEEQYGADGETEGQGLDTTAPGLMGTEQLSTELDS 
KTPTPPAPTLLKMTSSPVGPGTASAGPSLPGGALPTSVRSIVTT 
LVPSELI SAVPTTKSWHGG lASESLAG 




3 


1327 

1 

1 
I 
1 


idK^TViRIPAJU^KCIiHEDPyifPPPLPAEKPIGNTFSTVSGKLS 
flVERTRNLESNHPGCSTGGFVRVPPRLPPRpVNGKTIPTQQPPTK 
^PERPPPPKLSATRRSNKKr.PFmSSSDMDLQKKQSNIiATGLS 
CAKSQVPKNQDPVLPPRPKPGHPLYSKYMIiSVPHGrANEDIVSQ 
^PGELSCKRGDVLVMLKQTENWYLRCQKGEDTGRVHLSQMKLIT 
'riDEHLRSRPWPFSPPKAPSHaQKPVDSGAPHAWLHDFPAEQV 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal pepTTHT- 
(A-Alanxne, C=Cy3teine, D-Aspartic Acid, E= 
vFAu^anij.c /icia, r-ir'nenyl alanine, G=Glycine, 
H=sHistidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine 
P=Proline, Q-Glutamine, R=:.Arginine , 
S=Serine, T^Threonine, V=Valine, 
W»Tryptophan, Y-Tyrosine, X^-Unkno^ra, *^Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6783 






DDLNI,TSGi!;ivyi.ljEKIDTl)W5fRGKCRNQrGIFPANYVKVIIDI 
PEGGNGKRECVSSHCVKGSRCVARFEYIGEQKDELSFSEGEIII 
LKEYVNEEWARGETOGRTGIPPUUPVEPVEDYPTSGANVLSTKV 
PLKTKKEDSGSNSQVNSLPAEWCEALHSPTAETSDDLSFKRGDR 


6784 


3 


1750 


SYHHHHAggsAAAS PyfLTASQKTVTTTSMITTKTIjPLVIiKAATA 
xt irriovv\j\2K.if X J-ANVTAINSQKAVliSTDVQNTPVNLQTSSKVT 
GPGAEAVQIVAKWTVTLQVQATPPQPIKVPQFIPPPRLTPRPWP 
LPQVRPKPVAQWNIPIAPAPPPMLAAPQLrQRPVMLTKFTPTTL 
PTSQNSIHPVRWITGQTATIAKTFPMAQLTSIVIATPGTRIAGP 
QTVQLSKPSLEKQTVKSHTETDEKQTESRTITPPAAPKPKREBN 
PQKLAPMVSLGLVTHDHLEEIQSKRQERKRRTTANPVYSGAVPE 
PERKKSAVTYLNSTMHPGTRKRGRPPKYNAVLGFGALTPTSPQS 
SHPDSPENEKTETTFTFPAPVQPVSIaPSPTSTDGDIHEDFCSVC 
RKSGQltLWCDTCSRVYHLDCLDPPLKTrPKGMWICPRCQDQMLK 
KEEAIPWPGTLAIVHSYIAYKAAKEEEKQKLLKWSSDLKQEREQ 
LEQKVKQLSNSISKCMEMKNTILARQKEMHSSLEKVKQLIRLIH 
GIDLSKPVDSEATVGAISNGPDCTPPANAATSTPAPSPSSQSCT 


678S 


3 


1750 


SyHHHHAQQSAAA3PNLTASQKTVTTTSMITTKTLPLVLKAATA~ 

TMPASWGQRPTIAMVTAINSQKAVLSTDVQNTPVNLOTSSKVT 

GPGAEAVQrVAKNTVTLQVQATPPQPIKVPQFIPPPRLTPRPNF 

LPQVRPKPVAQNNIPIAPAPPPMLAAPQLIQRPVMLTKFTPTTL 

PTSQWSrHPVRWNGQTATIAKTFPMAQLTSIVIATPGTRLAOP 

QTVQLSKPSLEKQTVKSHTETDEKQTESRTITPPAAPKPKREEN 

PQKIAFMVSLGLVTHDHLEEIQSKRQERKRRTTANPVYSGAVPE 

PERKKSAVTYLNSTMHPGTRKRGRPPKYNAVIiQFGALTPTSPQS 

SHPDSPEKTBKTBTTPTFPAPVQPVSLPSPTSTDGDIHEDFCSVC 

RKSGQLLMCDTCSRVYHLDCLDPPLKTIPKGMWICPRCQDQMLK 

KEEAIPWPGTLAIVHSYIAYKAAKEEEKQKLLKWSSDLKQEREQ 

LEQKVKQLSNSISKCMEMKNTILARQKEMHSSLEKVKQLIRLIH 

GIDLSICPVDSEATVGAISNGPDCTPPAKAATSTPAPSPSSOSCT 

AWCNQGEETK 


6786 


1 


528 


LGNTVLHYCtiMySKPECLKLLLRSKPTVDIVNQAGETALDIAKR 
LKATQCEDLLSQAKSGKFNPHVHVEYEWNLRQEEIDESDDDLDD 
KPSPVKKERSPRPQSFCHSSSISPQDKLALPGFSTPRDKQRLSY 
GAFTNQIFVSTSTDSPTSPTTEAPPI^PPRirAGKGPTGPPITPHR 


6787 


X820 


1397 


RSPKVLVIiAf TREUUJHVSRUFKPI \TRKLTVARFYGGTSYQSQ 
J. i i^'taXllUJHJjQSGRLDLSKLRirvVLDEVDQML 

dlgpaeqvediihesyktdsednpqtllfsatcpqwvytva\kk 

YMKSRYEQVDIJDGKMTQKAATTVEHLAIQCHWSQRPAVIGDVLQ 
» * xir L«a 1 iyu^ V i bMAMNPH IKQNAQCLHGDIAQSQR 
EITLKGFREGSPKVIiVATNVAARGLDIPEVDDVIQSSPPQDVES 
YIHRSGRTGRAGRTGICICFYQPRERGQLRYVEQKAGITFKRVG 
VPSTMDLVKSKSMDAIRSIiASVSYAAVDFFRPSAQRLISEKGAV 
DALAAALAHISGASSPEPRSLITSDKGPVTMTLSSLEEIQDVSC 
AWKELMRKLSSNAVSQITRMCLLKGNMGVCFDVPTTESERLQAE 
WHDSDWILSVPAKLPErEEYYDGMTSSNSRQRSGWSSGRSGRSG 
RSGGRSGGRSGRQSRQ6SRSGSRQDQRRRSGNRNRSRSGGHKRS 

PD*VFYHI,VDFLSDPLVDSVYliTGRQIDHl.TGt,TGLIDHLTSHS 
SVWN 


6788 


2646 


2270 

] 
] 


PSSFPKNVPLfa'EHiEPPK*KRSGLGSLTPKSQIQNGP*PQTFPF 
FELGSPSGVISAHCNLRLLGSSDSPAPASRVAGIIGTCHHAWLI 
LiVFLVEMGFHHVGQAGLKLLTLXVIHPPWPPICVLGLQT 




16 


936 c 
I 


5GTVDLR\DMIAVSVIiAAVRGGR/ATVRRVRESKrVLHBkSKGKT 
IEGAEDKMTSGDVLSNRKMFYLLKTAFPSVQINTEEHVD\ELDQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corrBspondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

nucleotide 

location 

c orre spending 

to first 

amino acid 

residue of 

amino acid 

sequence 


Amino acid segmenc containing signal pepti'3^ 
(A«Alanine, C-Cysteine, D«.Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G-Glycine. 
H«Histidine, I«Isoleucine, K=Lysine, 
tj=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q^Glutamine, R=Arginitie, 
S=.Serine, T=.Threonine, V^Valine, 
W=Tryptophan, Y-Tyrosine, X=UnJcnown, *-stop 
Codon, /^possible nucleotide deletion, 
\t=possible nucleotide insertion) 


6789 






EVILWGS^DS*GYPKGK*LLPKSVPSR/RVLLSG~LfPLDATQE\ 
FTEDLSK\YVTTMVCVAVNGKPMLGVIHKPFSEYTAWAMVDGGS 
NVKARSSYNEKTPRIWSRSHSGMVKQVALQTFGNQTTIIPAGG 
AGYKVIALLDVPDKSQEKADLYIHVTYIKKWDICAGNAILKALG 
GHMTTLSGEEIS YTGSDGIEGGLLAS IRMNHQALVRKLPDLE KT 
GHK 


6790 


2 


678 


GNGINVLKlAPESArKFMAYEQIKRX,VW**PGDS*GF/Y£RLVA 
GSIiAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARRILARE 
GVAAFYKGYVPNMLGIIPYAGIDLAVYETLKNAWLQHYAVNSAD 
PGVFVLLACGTMSSTCGQLASYPLALVRTRMQAQASIEGAPEVT 
^^^^^rLRTEGAFGLYRGLAPNFMKVIPAVSISYWYENLKr 


6791 


2 


4068 


APPAGRRRMQAAPRAQCXSAALLLWIVSSCLCRAWtAPSTSQKCD 

EPLVSGLPHVAFSSSSSISGSYSPGYAKINKRGGAGGWSPSDSD 

HYQWLQVDFGNRKQISAIATQGRYSSSDWVTQYRMLYSDTGRNW 

KPYHQDGNIWAFPGNINSDGWRHELQHPIIARYVRIVPLDWNG 

EGRIGLRIEVYGCSYWADVlNFDGHWLPYRPRNKKMKTLKDVr 

ALNFKrSESBGVlLHGSGQQGDYITLELKKAKLVLSLNLGSNQL 

GPIYGHTSVMTGSLLDDHHWHSWIERQGRSINLTLDRSMQHFR 

TNGEPDYLDIiDYEITFGGIPFSQKPSSSSRKNFKGCMESINYNG 

VNITDLARRKKXiEPSNVGNLSPSCVEPYTVPVFFWATSYLBVPG 

RUJQDLFSVSFQFRTWNPNGLLVFSHFADNLGNVEIDLTESKVG 

VHINITQTKMSQIDISSGSGLNDGQWHEVRFLAKENFAILTIDG 

DEASAVRraSPLQVKTGEKYFFGGFLNQMNNSSHSVLQPSFQGC 

MQLIQVDDQLVNLYEVAQRKPGSFANVSIDMCAIIDRCVPNHCE 

HGGKCSQTWDSPKCTCDKTGYSGATCHNSIYEPSCEAYKHLGQT 

SNTYYWIDPDGSGPLGPLKVYCNMTEDKVWTIVSHDLQMQTPVVG 

YNPEKYSVTQLVYSASMDQISAITDSAEYCEQYVSYFCKMSRLL 

NTPDGSPYTWWVGKANEKHYYWGGSGPGIQKCACGIERNCTDPK 

YYCNCDADYKQWRKnAGFLSYKDHLPVSQVWGDTDRQGSEAKL 

S VGPLRCQGDRNYWNAASFPNPS S YLHFSTFQGETSADI SFYPK 

TLTPWGVFLENMGKEDFIKLBLKSATEVSFSFDVGNGPVEIWR 

SPTPLNDDQWHRVTAERNVKQASLQVDRLPQQIRKAPTEGHTRL 

ELYSQLFVGGAGGQQGFLGCIRSLRMNGVTLDLEBRAKVTSGFI 

SGCSGHCTSYGTNCEWGGKCLERYHGYSCDGSNTAYDGTFCNKD 

VGAPFBEGMWLRYNPQAPATOARDSSSRVDNAPDQQNSHPDLAQ 

EEIRFSFSTTKAPCILLYISSFTTDFLAVLVKPTCSLQIRYNLG 

GTREPYWIDVDHRrmANGQPHSVWirRHBKTIFLKLDHYPSVSY 

HLPSSSDTLFWSPKSLFIjGKVIETGKIDQEIHKYNTPGFTGCLS 

RVQFNQIAPLKAALRQTWASAHVHIQGELVESNCOASPLTLSPM 

ssatdpwhldhldsasadfpywpgqgqairngvnrnsai iggvi 

A\W1PTPSLCTP\VLP*SR*HVSPHKGTLPIPNEAKGAGSRQK 
KPGRRPSMNWDPPTSQRPIDESKKEWPHLRGGYLAMG 


6792 


1801 


1193 


TGHEGAKGEKt7UKGDLGPRGERGQHGPKGBKGYPGIPPSL/PGW"' 
SAW*SWLTAASTKVOAIljljPnPTiP*r/3mTat7M»c¥ «'t*t*»o*-(^ 

nsgiifssvetnignfpdvmtgrfgapvsgvyfftfsmmkhedv 

EBVYVYLMHNGNTVPSMYSYEMKGKSDTSSNHAVI.KLAKGDEVW 

lrmgkgalhgdhqrfstfagfll fetk 




33 


1073 

( 
i 
2 


VRHTNWGVDWrJUFSLGSESPKGAIQHiVSTEKTILAVERNiCVLL 

pplwwrtfswgpddfscclgsygsbkvlmtfeniaawgrclcav 
cpspttivtsgtstwcvwelsmtkgrprglrlrqalyghtqav 
rciiaasvtfsllvsgsqdcrcilwdldhlthvtrlpahregisa 

rXISDVSGTIVSCAGAHLSLWNVNGQPIASITTAWGPEGAITCC 

:lmegpawdtsqiiitgsqdgmvrvwkt/vgcedvcswtasrrg 

^PGSASKPKRPQVGEEPGI,ESRAGR*HCPDREAQQNQP\PVTAL 
\VSRNHTKLLVGDERGRIPCWSADG*'EERGSRGSGTTVPG 
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SEQ 
ID 
NO: 



Predicted 
beginning 
nucleotide 
location 
cortespondirxg 
to first 
amino acid 
residue of 
amino acid 
sequence 



Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
eec[uence 



Amino acid segment containing signal peptide 
<A==Alanine, C==Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H«Histidine, 3=Isoleucine, K=Lysine^ 
L-Leucine, M=Methionine, N=Asparagine, 
P= Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, Vs Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *«Stop 
Codon, /^possible nucleotide deletion, 
^t=po5sible nucleotide insertion) 



6793 



2340 



805 



GRKEANYNYGSLTQAGTVSLGLDAKGQEVFVPFSAVLPMVAPND 
LVFDGWDISSLiNLT^EAMRRAKVLDWGLQEQLWPHMEALRPRPSV 
YIPEFIAANQSARADNLIPGSRAQQLEQIRRDIRDFRSSAGLDK 
VIVLWTANTERFCEVIPGLI3DTAENLLRTIELGLEVSPSTLFAV 
ASILEGCAFLWGSPQNTLVPGALELAWQHRVFVGGDDFKSGQTK 
VKSVLVDFLIGSGLKTMSIVSYNHLGNNDGENLSAPLQFRSKEV 
SKSNWDDMVQSNPVLYTPGEEPDHCWI KYVP YVGDS KRALDE 
YTS BLMLGGTNTLVLHNTCBDSLLAAPIMLDLAliLTELCQRVS F 
CTDMDPEPQTFHPVLSLIiSFtjFKAPIiVPPGSPWNALFRQRSCI 
EN I LRACVGLPPQNHMLLEHKMERPGPSLKRVGP VAATYPMIiNK 
KGPVPAATNGCTGDANGHLQEEPPMPTT*GPGHTVSRLPLPAAP 
HDPTLKAPTNKGRCHFSPPSTWGSWGL 



6794 



169 



1349 



679S 



1010 



DDVKRKPEASAH*EKPGPPSRPGVRGGRERAGGRGSHGARSCR\ 
BPAP PAPAP PBDHPDEEMGFTIDI KS FLKPGEKTYTQRCRLFVG 
NLPTDITEEDFKRLPERYGEPSEVFINRDRGFGFIRLESRTLAB 
lAKAELDGTILKSRPLRIRFATHGAALTVKNLSPWSNELLEQA 
FSQFGPVEKAWWDDRGRATGKGFVEFAAKPPARKALERCGDG 
AFLIiTTTPRPVIVEPMEQFDDEDGLPEKLMQKTQQYHKEREQPP 
RPAQPGTPE PBYASRWKALDEMEKQQREQVDRNI REAKEKLEAE 
MEAARHEHQLMLMRQDLMRRQEEUtRLEELRNQBLQKRKQIQLR 
HEEEHRRREBEM IRHRSQEEIiRRQQEGFKPNYMEN YVCHFLR 



6796 



GPRROTQ VRDI IEI,DSF*DWAAQETDCAQNSGERL * KGV/ LENFS 
TMSKSAVKISLDLLSNPLCEQDQDiajNMVTALDTAMKRMDAFNQ 
BKVNQIQKTVIEPLKKFGSVPPSLNMAVKRREQALQDYRRLQAK 
VEKYEEKSKTGPVLAKLHQAREELRPVREDFEAKNRQLLEEMPR 
FYGSRLDYFQPS FESLIRAQWYYSEMHKIFGDLS HQLDQPGHS 
DEQRERENEAKLSELRALS I VADD 



683 



GKEIQIPTIKIiAWLXiPGLE* PVGALGKGWSF* * SHVALGQIjGW 
LTRAVRSSWRWELCVSAQEWSQRSA*SSPSPVGACPSLNPPET 
S VQEGRDCWQR* bPRLFSAIiVGQPGCWPQGAPPERCV* PGRCKW 
HIjQSQVLR*ERRRCCRCLPRFA*GWRRRHQRLGLGIHPAPL6ST 
5PPHPEGNSQQCRR * GWAAELRLPSS WL*GKLGC* 



6797 



1620 



211 



6798 



3894 



1696 



TERMTPSQPTRGSSCTRPSSMLWrSTWRCLTCHWAGMRMSWGV 
TLGPMAQGIiLSASGTTTEATWTRPTTHLTLIRWWLLTASRVDPP 
ERPPPPPSDDLTLLESSSSYKNL/DAQIPQ/DWSMSPSTSG*RP 
LTSRASSIMRSRTAIPSAS*SRI.TTKHTVGGSPSAWRPRPTSRS 
VSTPVSSSTETTASGSCLTWWSSSPAPCPSSSAPAHSFEASCCK 
TSLWGS CGGSGDGSS ACGSGWNLSMAGTS CSS PAMCSPSRAPS * 
RSASRPRTWRATTSAASSWAPRRCWCGWA*SAT*PSSTTTISSS 
PHCGWPCPASCASAAAWLSSTWATASVAGSCWGPIM*SSAHSPW 
CLSACSRSSMGTTCL*RSPP\SGASRAAAAWCGSSPSSTPTPSS 
ASSSTWCSASSSRSSPAPTTPSSIPAAQAQRRASCRPTSHSART 
APPPAS SAAGAARPAAFSTU^GTPRRS I RCW 



STISWESLESWLNKATNPSWRQEDWEYIIGFCDQINKELEG*VS" 

ALWGQLRGS6LGRGTTMAKEGQPGSPRLSALECVLLVPQ\PQIA 

VRLLAHKIQSPQEWEALQALTYLGDRVSEKVKTKVIELIiYSWTM 

ALPEEAKI KDAYHMLKRQGI VQSDPPIPVDRTLIPS P PPRPKNP 

VFDDEEKSKLLAKLLKSKNPDDLQEANKLIKSMVREDEARIQKV 

TKRLHTLEEVNNKVRLLSEMLIiHYSQEDSSDGDREIjMKSLFDQC 

ENKRRTLFKLASETEDNDNSLG0ILQASDNLSRVI NS YKT IIEG 

CVINGEVATLTriPDSEGNSQCSNQGTLIDLAELDTTNSIiSSVIiA 

PAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATLGPSSTSNAL 

SWLDEELLCLGIiADPAPNVPPKESAGNSQWHLLQREQSDLDFFS 

PRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPPPAPWPASVP 

APSAGSSLPSTGVAPAIAPK7BPAVPGHHGLALGNSALHHLDAL 

DQIiLEEAKVTSGLVKPTTS PLI PTTTPARPLLPFSTGPQS PLFQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
setjuence 


Amino acxd segmenc containing signal pepti<Ji 
(AssAlanine, C»Cvsteine C>-a«;na-r*--i r» &.^<;r3 ^ 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine^ 
L=Leucine, M^Methionine, N^Asparagine, 
P=Proline, Q^Glutamine, R=Arginine, 
S=Serine, T^Threonine, V==Valine, 
W«Tryptophan, Y^Tyrosine, X=Unknown, *=.Stop 
Codon, /^possible nucleotide deletion, 
\-possible nucleotide insertion) 








PLSFQSQGSPPKGPELSIiASIHVPIaESIKPSSALPVTAYDKNGF 
RILFHFAKECPPGRPDVLWWSMLNTAPLPVKSIVLQAAVPKS 
MXVKLQPPSGTELSPPSPIQPPAAlTQVMLLANPLKEKVRLRyK 
LTPALGEQLSTEVGEVDQFPPVEQWGNL 


6799 
6800 


3894 


1696 


STISWESLESWLNKATNPSNRQEDWEyiIGFCDQINKELEG*VS 

ALWGQLRGSGLGRGTTMAKEGQPGSPRLSALECVLLVPQ\PQIA 

VRLIAHKIQSPQEWEALQALTYLGDRVSEKVKTKVIELLYSWlVl 

ALPEEAKIKDAYHMLKRQGIVQSDPPIPVDRTLIPSPPPRPKNP 

V r iJA-f-o^iit^ aJjIj/uUjJjKSKNPDDLQEANKL IKSMVREDEAR 

TKRLHTLEEVNNNVRLLSEMLLHYSQEDSSDGDRELMKELFDQC 

ENKRRTLPKXASETEDNDNSLGDILQASDKLSRVINSYKTllEG 

QVINGBVATLTLPDSEGNSQCSWQGTI/IBIABLDTTNSLSSVIA 

PAPTPPSSGIPILPPPPQASGPPRSRSSSQAEATliGPSSTSNAIi 

SWLDEBLLCLGLADPAPNVPPKESAGNSQWHLLQREQSDtiDFPS 

PRPGTAACGASDAPLLQPSAPSSSSSQAPLPPPFPAPWPASVP 

APSAGSSLFSTGVAPALAPKVEPAVPGHHGIALGNSALHHLDAL 

DQIiLEEAKVTSGLVKPTTSPLIPTTTPARPLLPFSTGPGSPLPQ 

PLSPQSQGSPPKGPELSLRSIHVPLESIKPSSALPVTAYDKNGF 

RlIiPHPAICECPPf^P^rmT.tnTm/OMT M»r7iT>T n'^rt^e^-r^rr mkiLTm*.. 
n*. **^^.c^^„e'ir\^Kl:'JJVJJV V Vvor'JjjW lAfliFVKSl VliQAAVPKS 

MKVKLQPPSGTELSPFSPIQPPAAITQVMLLAI^PLKEKVRLRYK 
IiTPALGEQLSTSVGEVDQFPPVEQWGNL 




404 


1646 


RRSPSTGLSPVPQPSSPSJbSDySiPWSLLI,SGTIAWATPGK*AG 
*PQAW*LGLAPAIAFI/GLTRGRKQNKEKMAEGGSGDVDDAGDC 
SGARYNDWSDDDDDSNESKSIVWyppWARIGTEAGTRARARARA 
RATRARRAVQKRASPNSDDTVLSPQELQKVLCLVEMSEKPYH^ 
AALIALGNNAAYAPNRDIIRDLGGLPIVAKIUNTRDPIVKEKAL 
IVLNNLSVNAENQRRLKVYMNQVCDDTITSRLNSSVQLAGLRLXi 
TNMWTNEYQHMLANSlSDPPRLPSAGNEETKLQVLKLIiriNLAB 
KrPAMTREIiLRAQVPSSLGNSLFWKKBNKEVILKLLVIPENINDN 
FKWHENEPTQJIOFGEGSLFFFLKEFQVCADKVLGIESHHDFLVK 
VKVGKPMAKLAEHMPPKSQK 


6801 


2 


1755 


SAEEFESQQASVTMHDVDAESFEVLttoYCYl'^RVSLSEANVERL 
lAAia LwnjjQLiEYVRKACAS FLARRLDLTNCTAILKFADAFGHRKL 
RSQAQSYIAQNFKQLSHMGSIREETLADLTIiAQLtAVLRLDSLD 
VESEQTVCHVAVQWliEAAPKERGPSAAEVFKCVRWMHFTEEDQD 
YLEGIiLTKPIVECKYCLDVIEGALQMRYGDLLYKSLVPVPNSSSS 
/R*QQQLSCrCSRKSTPBTGYVCQGDGDLLWTPQRSLS\RYDPY 
SGDIYTMPSPLTSFAHTKTVTSSAVCVSPDHDXYIAAQPRKDLW 
VYKPAQNSWQQLADRLLCREGMDVAYIiNGyiyiLGGRDPrTCVK 
LKEVECYSVQRMQWALVAPVPHSFYSFELIWQNYLYAVNSKRM 
LCYDPSHNMWLNCASLKRSDFQEACVFNDEIYCrCDIPVMKVYN 
PARGEWRRISNIPLDSETHNYQIVKHDQKLLLITSTTPQWKKNR 
VTVYEYDTREDQWINIGTMLGLLQPDSGFICLCARVYPSCLEPG 
QSFITEEDDARSBSSTEWDLDGFSELDSESGSSSSFSDDEVWVQ 
VAPQRNAQDQQGSL 


6B02 
6803 


157 
1 


1341 

2203 ] 


liTFPLFFFLLSKTPGKTASMAHFVQGTSRMIAAESSTEHKECAE 
PSTRKNLMITSIiEQKIRCIiEKQRKEI,LEVKQQWDQQPRSMKELYE 
RKVAELKTKLDAAERFLSTREKDPHQRQRKDDRQREDDRQRDLT 
RDRLOREEKEKERLNEELHELKEENKLLKGKWTLANKEKEHYEC 
EIKRLNKALQDTU^IKCSFSEDCLRKSRVBFCHEEMRTEMEVLK 
QQVQIYEEDFKKERSDRERLNQBKEELQQINETSQSQLNRLNSQ 
IKACQMEKEKLBKQLKQMYCPPCNCGLVFHLQDPWVPTGPGAVQ 
KQREHPPDYQWYALDQLPPDVQHKAN/DWCIlAPPPVCCQAG/PR 
rPGLK*SSCLWLPKC*NPRFIZ.SKESPSVEVHTNRERQQATRER 

KLSGRPYRHMQA/LGTSKLYDIRKTIFTFTPQFIDCXSQFYIALDN 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=»Alanine, CsCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G-Glycine, 
H=Histidine, I^alsoleucine, K=Lysine, 
LaLeucine , M=Methianine , N=Asparagine / 
P=Proline, Q=Glut amine, R^Arginine, 
S=Serine, T=Threonine, V=>Valine, 
W=.Tryptophan, Y=Tyrosine, X=UnJcnown, *=stop 
Codon, /^possible nucleotide deletion, 
\=pos3ible nucleotide insertion) 








KMIVEMLRTDLSYLCSRWR>aTGQPTITFPISHSMLDEDGTSLNS 
S I LiAALRKMQDGYFGGARVQTGKIjSEFLTTSCCTHLS FMDPGPE 
GKLYSKDYDDNYDYLESGNWMNDYDSTSHARCGDEVARYLDHLL 
An i AfHV xtLiAf X bUlCtaCvLiUKr QAAVQTTCDLMSLVTKAKELHVQ 
NVHMYIiPTKLFQASRPS FNLLDS PHPRQENQVPSVRVE IHLPRD 
QSGEVDFKALVLQLKETSSLQEQADrLYMIiYTMKGPDWNTELYW 
ERSATVRELLTELYGKVGEIRHWGHRYISGILRKKVEALDEAC 
TDLLSHQKHIiTVGLPPEPREKTISAPLPYEALTQLIDEASEGDM 
SISILTQEIMVYLAMyMRTQPGLFAEMFRLRIGLIIQVMATELA 
HSLRCSAEEATEGLMNLSPSAMKNLLHEILSGKEFGVERK/SVR 
PTDS NVSPAI SIHEIGAVGATKTERTG IMQI»KSBIKQVE FRRLS 
ISAESQSPGTSMTPSSGSPPSAYDQQSSKDSRQGQWQRRRRIiDG 
ALNRVPVGFYQKVWKVLQKCHGLSVEGFVLPSSTTREMTPGE I K 
PSVHVES\VLNVLIiRPEYRQI*I»VEAILVLTMLADIElHSIGSII 
AVEKIVHlANDIjFi:jQEQKTLGP\DDTMIAKDPASG\lCTLR\yD 
SAPSGRFGTMTYLS \RAA\ATYVQEFLP\HS I CAMQ 


6804 


1 


9S1 


GSPGKKEEKAKNKESLCMENSSNSSSDEDEEETKAKMTPTKKYN 
GLESKRKSLRTTGFYSGFSEVAEKRrKLLNNSDERLQNSRAKDR 
KDVWSS IQGQWPKKTLKELFSDSDTEAAAS P PHPAPEEG VAEES 
LQTVAEEESCSPSVBLEKPPPVNVDSKPIEEKTVEVKDRKAEFP 
S SGSNFS A* I PLPYLHLNRLHQSL* QKGSRQQSS VTVSEPLAPN 
QBE VRS IKSET0STIEV0S VAGEIjQDLQSERE *IiASRF * CQCEI* 
KQ * * SARTRTS *KSLYRSEKSERCSGRRKF1 KKAEKKP * SNSGK 
QQKEGKRHK 


6805 


1539 


206 


RQPDLKYFGKS PDVS VSESSSLX.SNDLPKFADG I KARNRNQNYIi 
VPS P VLRILDHTAFSTEKSADIVICDEECDS PES VNQQTQEESP 
I EVHTABDVP lAVE VHAISEDYDI ETENNSSESLQDQTDEEPPA 
KLCKILDKSQAIiKVTAQQKWPLIiRANSSGLYKCELCEFNSKYFS 
DLKQHMILKHKRTDSNVCRVCKESFSTNMLL I EHAKLHEEDPY I 
CKYCDYKTVIFENLSQHIADTHFSDHrjYWCEQCDVQFSSSSELY 
LHFQBHSCDEQYLCQFCEHBTNDPEDLHSHWNEHACKLIELSD 
KYNNGEHGQYSLLSKITFDKCKNPFVCQVCJGFRSRLHTNVNRHV 
AIEHTKI PPHVCDDCGKGFSSMLE \ lAKHLNSHLSEG I YLCQYW 
EYSTGQIEDLKIHLDFKHSADLPHKCSDCLMRFGNERBLISHLP 
VHETT 


6806 


272 


3794 


VALCFPNSDPVMFMDAFYGCbLAELGPVPIEVPLTRKDAGSQQV 
GFIiLGSCGVFIiArjTTDACQKGr/PICAQTGEVAAFKGWPPIiSWLVI 
DGKHIiAKPPKDWHPLAQDTG'PGTAYI E YKTS KEGSTVGVTVSHA 
SLLAQCRALTQACGYSEAETLTNVLDFKRDAGLWHGVLTSVMNR 
MtrVVSVPYALMKANPLSWIQKVCFYKARAALvksRDMHWSLrAQ 
RGQRDVSLSSLRMLIVADGANPWSISSCDAPLNVPQSRGLRPEV 
ICPCASSPEALTVAIRRPPDLGGPPPRKAVLSMNTGLSYGVIRVD 
TEEKLSVLTVQDVGQVMPGANVCWKIiEGTPYLCKTDEVGEICV 

FIGPDHLVP 1 VGKLDGLMVTGVRRHNADDWATALAVEPMKFVY 
RGRIAVFSVTVLHDDRIVLVAEQRPDASEEDSFQWMSRVLQAID 
S IHQVGVY CliALVPANTLP KAPLGG I HI SETKQRPXjEGTLHPCN 
VLMCPHTCVTNLPKPRQKQPBVGPASMIVGNLVAGKRIAQASGR 
ELAHLEDSDQARKFLFLADVLQWRAHTTPDHPLFLIjLNAKGTVT 
STATCVQLHKRAERVAAALMEKGRLSVGDHVALVYPPGVDIiIAA 
FYGCLYCGCVPVTVRPPHPQWJjGTTLPTVKMIVEVSKSACVLTT 
QAVTRIiLRSKEAAAAVDIRTWPTILDTDDIPKKKIASVFRPPSP 
DVLAYLDFSVSTTGILAGVKMSHAATSALCRSIKLQCELYPSRQ 

iaicldpygglgfalwclcsvysghqsvlvpplelesnvslwls 
avsqykar\n?fccysvmemctkg3jgaqtgvlrmkgvltlscvrtc 
mwaberp\rialtqsfsklfkdlglparavsttpgcrvnvaic 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine, K=Lysine, 
L=rjeucine, M»Methionine, N^^Asparagine , 
P=*Proline, Q^=Glutamine , R^Arginine, 
S=Serine, T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 








LQGTAGPDPTTVYVDMRALRHDRVRLVERGSPHSLPLMESGKIL 
PGVKVIIAHTETKGPLGDSHLGEIWVSSPHNATGYYTVYGRBAL 
HADHFSARLSFGDTQTIWARTGYLGFLRRTELTDASGGRHDALY 
WGSLDETLELRGMRYHPIDIETSVIRAHRSIAECAVPTWTNLL 
WWELDGLEQDAIjDLVALVTNWLBJBHYL WGVWI VDPGVI p 
INSRGEKQRMHLRDGFLADQLDPIYVAYNM 


6807 
6608 


1444 


606 


VGHDTVHAwrrCFPKCLGFSPPVNVTVSPRSEESHTTTVSGGNG 
. SVFQAGPQLQAIiANLEARRGSlGAALSSRDVSGLPVYAQSGEPR 
RLTQAQVAAPPGENALEHSSDQDTWDSLRSPGPCSPLSSGGGAE 
SLPPGGPGHAEAGHLGKVCDFHLNHQQPSPTSVLPTEVAAPPLE 
KILSVDSVAVDCAYRTVPKPGPQPGPHGSLLTEGCLRSLSGDIiN 
RFPCGME VHS GQRELES WAVGEAMAVlKFPMGAMS vr T,Rnp Qp 
FLFRLPMGLSCPLQVQ 




2063 


737 


GVGSGAASAIARSRPLASRLSSRRRTRAPRSGAMQRIxAMDLRML 

srelslyiiehqvrvgffgsgvglsl i lgfsvayaf yylss lakk 
pqlvtggesfsrflqdhcpvvtetyyptvwcwegrgqtli*rpf\ 

ITSKPPVQYRNELIKTADGGQISLDWFDNDITSTCYMDASTRPTI 

lllpgltgtskesyilhmihlseelgyrcwfnnrgvagenllt 

PRTYCCANTEDLETVIinjVHSLYPSAPFLAAGVSMGGMI/LLNyL 

GKIGSKTPLMAAATFSVGWNTFACSESLEKPIiNWLLFNYYLTTC 

LQSSVNKHRHMFVKQVDMDHVMKAKSIREFDKRFTSVMFGYQTI 

DDYYrDASPSPRLKSVGIPVLCLNSVDDVFSPSHAlPIETAKQN 

P1^VAI.VLTSY6GHIGFLEGIWPRQSTYMDRVFKQFVQAMVEHGH 
ELS 


6809 


939 


65 


DYSGQTPVPTEHGMTLYTPAQTHPEQPGSEASTQPIAGTQTVPQ 
TDEAAQTDSQPliHPSDPTEKQQPKRIiHVSNlPPRFRnpnr.pnMip 

GQPOKILDVEIIFNERGSKGPGFVTFETSSDADRAREKLKGTIV 
EGRKIEVNNATARVMTNKKTGNPYTNGWKLNPWGAVYGPEFYA 
VTGFPYPTTGTAVAYRGAHLRGRGRAVYNTFRAAPPPPPIPTYG 

awyqdgfygaeiVleatqptdtlsplqrrqptatvtaestqlp 

TRTITPSGPRRPTALEPCETFHRFLLGP 


6B10 
6S11 


939 


65 


dysgqtpvptehgmtlytpaqthpeqpgseastqpiagtqtvpq 

TDEAAQTDSQPLHPSDPTEKOOPKRIiHVSNl PPP r-pripnT -ariMv 

gqfgkildvei ifnergskgfgfvtfbtssdadrarbklngti V 
eqrkievnnatarvmtnkktgnpytngwklnpwgavygpefya 
vtgfpypttgtavayrgahlrgrgravyntfraapppppiptyg 

AWYQDGFYGAE I \LEATQPTDTLS PLQRRQPTATVTAESTQLP 
TRTITPSGPRRPTALEPCETFHRPLI/5P 




1^22 


658 


DLVTVWSFVDCRVIASTHGH\KSWVSW7^DPYTTSVEBGDPME 
PSGSDEDFQDLLHPGRDRADSTQCRLSRRNSTDSRPVSVTYRFG 

svgqdtqlclwdltedilfphqplsrarthtnvmnatsppagsn 

GNSVTTPGNSVPPPLPRSNSLPHSAVSNAGSKSSVMDGAIASGV 

skfatlslhdrkerhhekdhkrnhsmghisskssdklnlvtktk 

TDPAKTLGTPLCPRMEDVPLLEPLICKKIAHERIiTVLIPLEDCI 
VTACQEG FI CTWGRPGKWS FNP 


6812 


4001 " ■ 


1582 

1 
] 
] 


iSDAVFSLDLSTIIQGrWPLWGEBLKSJ^PEGQVKPGAUlYRIEQ 
KGLQHRLIIUIAVKHQDSGAIjVGFSCPGVQDSAALTIQESPVHIL 
SPQDKVSLTFXTSERWIiTCELSRVDFPATWYKDGQKVEESELL 

wkmdgrkhrlilpeakvqdsgefecrtegvsaffgvtvqdppv 

HIV0PREHVFVHAXTSECVMLACEV\DR\EDAPVRWYKDGQEVE 

esdfwlbnegphrrlvlpatqpsdggefqcvagdecayftvti 
tdvsswivypsgkvyvaavrlerwltcblcrpwaevrwtkdge 
bwespalllqkedtvrrlvlpavqledsgeylce iddesasft 

mrrEPPVRliypRDEVTLIAVTIiECWLMCELSREDAPVRWYK 

dgleveesealvlerdgprcrlvlpaaqpedgqepvcdagddsa 

PFTVTVTEpPVQFIiALETTPSPliCVAPGEPWLSCELSRAGAPV 
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SEQ 
ID 
KO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide " 
(A«Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Hi3tidine, I=Isoleucine, K=Lysine, 
xj-xieucxne, M=Metnionj,ne, Ni^Asparagine , 
P=ProlinG, Q=»Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=j Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown, ♦=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








VWSHNGRPVQEGEGLELHAEGPRRVlKriQAAGPAHAGLYTCQSG 
xwirvj/vfiaijar J. vy vHiSrl^ vJiv VAi'CtAAQTRVRSTPGGDLELVVH 
LSGPGGPVRWYKDGERIASQGRVQLEQAGARQVLRVQGARSGjDA 

geylcdapqdsriflvsveepllvklvsdltpltvhegddatfr 
cevsppdadvtwlrngawtpgpqrqsccsyggcrmcgqrkart 

CVSKWRQAEWVQRGPCAGCEVGSPCPTTTACPWPRMGTSTASSS 
MVSYWPTRAPTAARArXIAPWPGSA 


6813 




836 


sstqqrpgvpagprpldgylgvadhkplkmhcrdcalvtssghl 
lhsrqgsqidqtecvirmmdaptrgygrdvgitrtslrviahssi 
qrilr2jrhdllwvsqgtvfi fwgpss ymrrdgkgqvynnlhiis 
qvlpri,kapmitrhkmiiqfdeijfkqetgq\nrkisntwiistgwf 
tmtialelcdr INVYGMGP pdfcrdpnhps vpyhyyepfgpdec 

TMYLSHERGRKGSHHRPITEKRVFKNWARTFNIHFFQPDWKPES 
LAINHPENKPVF 


6814 


3 


737 


KFRRQEAN/ARERNRMHGIJJIDAIiDNIiRKVVPCYSKTQKLSKIET"" 

URbAKNYIWALSEILRIGKRPDLLTFVQNLCKGLSQPTTNLVAG 

CLQLBTARSFLMGQGGEAAHHTRSPYSTPYPPYHSPELTTPPGHG 

TLDNSKSMKPYNYCSAYESFYESTSPECASPQFEGPLSPPPINY 

NGIFSLKQBBTLDYGKNYNYGMHYCAVPPRGPLGQGAMFRLPTD 

SHFPYDLIiLRSQSbTMQDEIiMAVPHN 


6815 


906 


553 


QGLDPASQXKWEIjLKDGSGRRGDRRSSRDMAGGAGPRSESDLE 
DVGPTAEWNGDGSGSLRRSGSFGKLRDALRRSSEMCVKKLQGGT 
PQEP PNPRMKRASSIiNFLNKSVBSPTQPGG 


68X6 


1 


803 


NLLICTHKF\I,l>GQDEDSLHSVPVAQMGNVQEYLKTrASPIJiEID 
PDQPKRLHTPGNPFKQDKKGMMIDEADEFVAGPQNKVKRPGEPN 
SPMSSKRRRSMSIiLLRKPQTPPTVTNHVGGKGPPSASWFPSYPN 
LIKPTLVHTDATIIHDGHEEKMENGQITPDGFLSKSAPSEIjINM 
TGDLMPPNQVDSLSDDFTSLSKDGIiIQKPCSNAFVGGAKNCSLS 

vddqkdpvastlgampntlqitpamaqginadikhqlmkevrkf 

GRSK 


6817 
6818 


172 
2 


3457 
240 


LGMMDSPKlGNGLPVIGPGTDIGISSIiHMVGYLGKNFDSAKVPS 

deycpackekgklkalktyrisfqesiflcedlqciyplgsksl 
nnlispdlbechtphkpqkrkslbssykdslllanskktrnyia 

IDGGKVLNTSKHNGEVYDETSSNLPDSSGQQNPIRTADSLBRNBI 
LEADTVD^3ATTKDPATVDVSGTGRPSPQNEGCTSICLE^5PLBSKC 
TSFPQALCVQWKNAYALCWIiDCILSALVHSEELKNTVTGLCSKE 
ESIFWRliLTKYNQANTLLYTSQLSGVKDGDCKKLTSEIFAEIET 
CLNEVRDEIPISLQPQLRCTLGDMESPVPAFPLLLKLETHIEKL 
PLYSFSMDFECSQCX3HQYQNRHMKSLVTPTNVIPEWHPLNAAHP 
GPCNNCNSKSQIRKMVLEKVSPIFMLHFVEGLPQNDLQKYAFHP 
EGCLYOITSVIQYRANNHFITWILDADGSWLECDDLKGPCSERH 
KKFBVPASEIRlVIWERKISQVTDKBAACLPLKKTNDQHALSKrS 
KPVSLTSCSVGDAASAETASVTHPKDISVAPRTLSQDTAVTHGD 
HLLSGPKGLVDNILPLTLEETIQKTASVSQLNSEAFLMjENKPV 

aentg iiiktntllsqeslmass vsapcwekl iqdqfvd isfpsq 
wntnmqsvqlntedtvntksvnntdatgrilqgvksveiekdaq 
lkqfltpkteqlkpervtsqvsnlkkkettadsqtttskslqnq 
slkenqkkpfvgs5jvkgl1srgasfmplcvsahnrntitdlqps 
vkgvnnfggfktkgiwqkashvskkarks/^kpppiskppagpp 
ssngtaahphahaasevleksgstscgaqlnhssygnfgissanh 

EDLVEGQIHKIJILKLRKKLKAEKJCKIJUW^MSS PQSRTVRSENLE 

QVPQDGSPJTOCES lEDLLNELP YP IDIANESACTTVPGVSLYSS 

QTHEBILAELLSPTPVSTELSENGEGDFRYLGMGDSHIPPPVPS 

EETOVSQNTHLRQDHKYCSPTKKNPCEVQPDSLTNNACVRTLNL 

ESPMKTDIFDEPPSSSALMAIiANDTLDLPHFDEYLPBNY 

RGPDKVLWT/LSGAVKNCVQFSRISPDGEEGYPGELKVWVTYTL 
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SEQ 
10 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal pepti<i^ 
(A=^anine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Hi3tidine, I=Isoleucine, K^Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, 0=Glutamine, R=Arginine, 
S=Serine, T-Threonine, VsValine, 
WxTryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6819 


1 


961 


dgge/lhs/attehkp/vqatpvnlt\tiltstwqarlpqi ■ 
g i pctemgnfdnanvtge xefaihycfkthslei ci kacknlay 
geekkkkcnpyvktyllpdrssqgkrktgvqrntvdptfqetlk 
yqvapaqlvtrqlqvsvwhlgtlarrvflgeviiplatwdfeds 

TTQSFRWHPLRAKADKYEDSVPQSNGELTVRAKLVLPSRPRKLQ 
EAQEGTDQPSLHGQLCIiWLGAKNLPVRPDGTLNSFVKGCLTLP 
BQQKLRLKSPVLRKQACPQWKHSFVFSGVTPAQLRQSSLELTW? 
DQALFGMNDRLLGGT\RLGSKGDrAVGGDACSQSKLQWQKVLSS 
PNLWTDMTLVLH • 


6620 


1014 


340 


GDMVYIVGHVPPGFFEKTQNKAWFREG^WEKYLKVVRIomWlA 
GQFFGHHHTDSPRMLYDDAGVPISAMFITPGVTPWKTTLPGWN 
GANNPAIRVFEYDRATLSLKDMVTYFMNLSQANAQGTPRWELEY 
QLTEAYGVPDASAHSMHTVLDRIAGDQSTLQRYYVYNSVSYSAG 
VCbEACSMQHVCAMRQVDIDAYTTCLYASGTTPVPQLPLLLM/iL 
LGLCT 


6821 


1088 


518 


EFDIYR/BVGGEFVPVTRDDSSNGFPRTQHGPSPTVHPIQSPQN" 
RFCVLTLD PETLPAI ATTLIDVL P V^HQl*© VT?lia O c c nt? rt o e TrT» 

FFAFSblEGYlXSIVMDAETQKKFPSDLLLTSSSGELWRMVRlG 
GQ PLG FDECGI VAQ I AG PLAAADI SAYYISTPNFDHAIiVPEDGl 
GSVIEVLQRRQBGLAS 


6822 


1088 


518 


EFDI YR/EVGGEFVP VTRDDSSWGFPRTQHGPS PTVHPI QSPQN 

RFCVLTliDPETLPAIATTLIDVLFYSHSTPKEAASSSPEPSSIT 
FFAFSIiIE(3YT\55TVMr>ai!'TnH'Vfi*tJGnT TT-TCOont^T rjTi»iit»o-r^ 

GQPLGFDECG I VAQIAGPLAAADI SAYYISTPNPDHALVPEDGl 
GSV lEVLQRRQEGLAS 


6823 


654 


221 


PPKLLSRWARMGHGDB I V\IiSDIiNFPGIJ:,HLPWGPWRSVQTAC ~ 
GiPQLLBAVLKIiLPLDTYVESPAAVMELVPSDKERGLQTPVWTE 
YESILRRACCVRAIAKIERFEFYERAKKAFAWATGETALYGNL 
ILRKGVLAIiNPLL 


6824 


858 


104 


IiLLAQRWGWG\ CCFFSLAVSVKMNVLLFAPGLLFLLLTQFGFRG 
ALPKLGICAGLQWLGLPFLLENPSGYLSRSFDLGRQPLFHWTV 
NWRFLPEALPLHRAFHIiALLTAHLTIiLLLFALCRWHRTGE S I LS 
LLRDPSKRKVPPQPLTPNQIVSTLPTSNFIGICPSRSLHYQPyv 
WYPHTLPYLLWAMPARWLTHLLRBliVLGLIELSWNTYPSTSCSS 
AAIiHI CHAVILLQIiWLGPQPFPKSTQHSKKAH 


6825 


3 


1173 


SSGEFGLQASDIMWTISDTGWILIILCSLMEPWAU3ACTFVHLL 
PKFDPLVIIiKTLSSYPIKSMMGAPlVYRMLLQQDLSSYKFPHLQ 
NCLAGGESLLPETLENWRAQTGLDIRE FYGQTETGLTOWS KTM 
KIKPGYMGTAAS CYDVQI IDDKGNVLPPGTEQDIGXRVKP IRPI 
GIFSGYVDNPDKTAANIRGDFWLLGDRGIKDEDGYPQFMGRADD 
IINSSGYRIGPSEVENALMEHPAWETAVISSPDPVRQEWKAF 
VILALOFLSHDPBQLTKELQQHVKSVTAPYKYPRKIEFVIiNLPK 
TVTGKIQRA\KLRDKEWKMSGKAPCAVRHIiRDIHLDSPLIiSLSF 
PFGPIiALPMDGYGDSLWEEHEYKFCLALVrSTKLYHVRC 


6826 


2304 


954 


LKTES FKPW/ VN lALAFHLLGBRAS PNSPWQP YIQTLPRE YDTP 
LYFEEDEVRYLQSTQAIHDVFSQYKNTARQYAYPYKVIQTHPHA 
NKLPLKDS PTYEDYRMAVSS VMTRQNQXPTEDGSRVTLAIiI PLW 
D^3CNHT^^GLITTGYNriEDDRCECVALQDFRAGEQIyIFYGTRSN 
AEFVIHSGFFFDNNSHDRVKI KLGVS KSDRL YAMKAEVLARAGI 
PTSSVFALHFTEPP I SAQLLAFLRVFCMTSEELKEHLLGDSAID 
RIFTLGMSEFPVSWDNEVKLWTFLEDRASLLLKTYKTTXEEDKS 
VLKNHDLSVRAKMAIKLRLGEKEILBKAVKSAAVNREYYRQQME 
EKAPLPKYSESNLGLLESSVGDSRLPLVLRNLEEEAGVQDAIiUI 
RBAISKAKATENGLVNGENSIPNGTRSENESIiNQESKRAVEDAK 
6SSSDSTAGVKE 



558 



wo 01/53312 



PCTAJSOO/34263 



SEQ 
ID 

NO: 

6827 


PrediGted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

nucleotide 

location 

to first 
amino acid 
residue of 
amino acid 
seqpience 


Amino acid eegmenc containing signal peptiHi" 
(A=.Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H«Hxstidine, I=Isoleucine, K=tLysine, 
LsLeucine, M=Methionine, N=Asparagine , 
P=Proline, Q=GluCamine, R=Arginine, 
S=Serine, T=:Thr eonine , v=Valine, 
"^Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /sspossible nucleotide deletion, 
\=possible nucleotide insertion) 


— g828 


1 


779 


SSWEFGLSVLGGLFLLFVLENMLGLLRHRGLRPRCCRRKRRNL 
ETRNLDPENGSGMALQPLQAAPEPGAQGQREKNSQHPPALAPPG 
HQGHSHGHQGGTDITWMVLLGDGLHNLTDGIiAIGAAFSDGFSSG 
LSTTLAVFCHELPHELGDFAMLLQSGLSFRRLLLLSLVSGALGL 
GGAVLGVGLSI.GPVPLTPWVFGVTAGVPLYVALVDMLPALFPSS 
_ _GAPAYA\HVLI^LGLLLGGCLMIAITIiEERLLPVTTEG 


6829 


3 


1654 


KSQHG/WIbQLMHSCKEGYVKDLKGNPGLHRAMLDLDNGTRPSE 
LGHLSQTASLKRGSSFQSGRDDTWRYKTPHRVAFVEKLTKLVLS 
QLPNFWKLWrSYVNGSLFSETAEKSGQIERSKNVRQRQNDPKJCM 
IQEVMHSLVKLTRGALLPLS IRDGEAKQYGGWBVKCELSGQWLA 
HAXQTVRLTHESLTALEIPNDIAQTIQDLILDLRVRCVMATLQH 
TAEEIKRIiAEKEDWIVDNEGIiTSLPCQPEQCIVCSLQSLKGVLE 
CKPGEASVPQQPKTQEEVCQLSINIMQVPIYCLEQLSTKPDADI 
DTTHLSVDVSSPDLFGSIHEDFSLTSEQRIiLIVLSNCCYLERHT 
PLN I AEHFEKHNFQGIEKITQVSMASLKBLDQRLFENY I ELKAD 
PIVGSLEPGIYAGYPDWKDCI.PPTGVRNYI,KEALVNIIAVHAEV 
PTISKELVPRVLSKVIEAVSEELSRLMQCVSSFSKNGALQARLK 
ICALRDTVAVYLTPESKSSFKQALEALPQLSSGADKKLLEBLLN 
' KFKSSMHLQLTCFQAASSTMMKT 


6830 


1 


782 ^ 


MKMEAGEAAPPAGAGGRAAGGWGKWVRLNVGGTVFLTTRQTLCR " 

BQKSFLSRLCQGEELQSDRDETGAYLIDRDPTYFGPILNFLRHG 

KLVLDKDMAEEGVLEEAEFYNIGPLIRIIKDRMBEKDYTVTQVP 

PKHVYRVLQCQEEBLTQMVSTMSDGWRFEOLVNIGSSYKYGSED 

QAEPLCWSKELHSTPNGLSSESSRKTKSTEBQLEEQQQQEEEV 

EBVSVEQVQVEADAQEK/CCYKPEAPGCEAPDHLQGLGVPI 


6831 


1 


939 


MiSPSSVBNLs IVYRSRDFLVVNKHWDVRIDSKAWRETLTLQkQL 
RYRFPELADPDTCYGPRFCHQLDFSTSCSALCVALNECAAAGSAyR 

cfkerrvtkaylallrghiqesrvtishaigrnstegrahtmci 
egsqgcenpkpsltdlwlehqlyagdpvskvllkpltgrthqi, 
rvXhcsalghpwgdltyqevsgredrpfrmmlhafylriptdt 

ECVEVCTPDPPLPSLDACWSPHTLLQSLDQLVQALRATPDPDPE 

drgprpgspsallpgpgrppppptkppeteaqrgpclqwlsewt 

LEPDS 


6832 


3 


1087 


SLFFGSSTPDKKVAEQEDLETQPSfe>SVEKAVTVlbPEGTIPTNF 

nvaekpadhslsevklktadeprgtlvksgdgqnvkeksmilsn 

VE0LQQPKPISEVSREDYGKKEISGDSESMNINSWTSADGENL 

eiqsyslighklvmeeaktivpphvtdskrvqkpaiappskwni 
sifkeeprsdqkqksllsfdwdkvpqqpksassnfasknitkb 
sekpesiilpveeskgslidfsedrlkkemqnptslkiseeetk 

LRSVSPTEKKXlNLENR\SYTb\AEKKVLAEKQNSV\APLELRDS 

neigktqitlgsrstelkeskadampqhpyqnedynerpkiivg 
sekekdeickkk 


6833 


1809 


412 

• 


mgsglisgpp«dnsgealkeperaqehslpnpaggqhffeyllv 

VSLKKKRSEDDYEPIITY0FPKRENLUi(5nnT?T?PPT?T t wrtist d 
CPPDGNE WASLTE YPRETFS FVLTNVDGSRKIGYCRRUJPAGPG 
PRLPKVYCIISCrGCFGLFSKILDEVEKRHQISMAVIYPFMQGL 
REAAFPAPGKTVTLKSPlPDSGl'EFISLTRPIiDSHLEHVDFSSL 
LHCLSFEQILQIFASAVLERrCIIFLAEGLSTLSQCIHAAAALLY 
PFSWAHTYIPWPESLLATVCCPTPFMVGVQMRFQQEVMDSPMH 
EVLLVNLCEGTPLMSVGDEKDILPPKLQDDILDSLGQGINELKT 
^EQ1NEHVSGPPVQFFVKIVGHYASYIKREANGQGHFQERSFCK 
WJTSKTNRRFVKKFVKTQLFSLFIQEAEKSKNPPAGYFQQiax.E 
fBEQKKQ/TETKGKNCEIRAWNKND 


- 


1 


1129 ] 
< 


^IjMTIiSQCGGJ.PGHGHSHGGHGHGHGLPKGPRVKSTRPGSSD5jr~ 

7APGEQGPDQEETWTLVANTSNSNGLKLDPADPBNPRSGDTVEV 

3VNGNLVREP0HMELEEDRAGQLNMRGVFLHVLG0ALGSVIVW 
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wo 01/53312 



PCTAJSOO/34263 



SBQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segmenc containing signal peptiaT" 
{A«Alanine, C=Cysteine, D^Aspartic Acid, E= 
Glutamic Acid, P= Phenyl alanine, G^Glycine, 

H— fli S fc id 1 TIP* T — T CtrtT iaii m «/ T _ _ 

^ ^ » X— xtooxeucme, K5=i>ysjLnG/ 
LsLeucine, M=Methionine, N=Asparagine , 
P=Proline, Q^Glutamine, R^Arginine, 
SsSerine, T*=Threonine , V=«Valine, 
W^Tryptophan, Y=TVrosine, X«Unknown, *=Sbop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








NAIiVFYFSWKGCSEGDFCVNPCFPDPCKAFVKIINSTHASVYEA 
GPCWVLYLDPTLCWMVCILIiYTTYPLLKKSALILLQTVPKQID 

IRNLIKELRir\^GVEE VHELH VWOTJVr:<513 T T aTBUT vr«TP-ririrTi Ci V 

mevaktikdvfhnhgihattiqpepasvgsksswpcelacrtq 
calkqcx:gtlpqapsgkdaektpavsisclelsnnlekkprrtk 
aenipa\wibikn\ipnk\qpessl 


6834 
6835 


78 


llSl 


AGQERPAPlWRLLWIiPTPSVSRKABPAHIPINR*GA*E*RGGLP " 

LCGSSASAYGWH*RLTPWSPGGS*HM*SSKAPVTQAREVIiVAGP 

CSKLVLSGARGIVGTTVQVLVEAQQPLLLLFTGVWGLNI,RAGEE 
SRAIj*LIEEVTO\/RDAHIiftMaVvr'r"&Ar'T cnnmrr'onT ^vh-pt 

•'WTHrf M^u4jv j,vv zijj^irtxjuvtM. V V tj<J>tLK- i->aUtjCJVK.S A T . A AIiT iR 

AAAAVRDCKBVLTVSGDKQQAEVSVRL^VRDVCVEEAGCVEFGQ 

AHGRPGLALAKGRGGTNBVEEQVQVDGVQKLVLSAHECHELVAG 

QQDGEDQAARTRLLQAGAHSVAHGRRQGQAPCRPHQEAGVSCHE 

LQQWGDAl,*ARE*APQriVLLLLEDVAQLRTGKKA*DLWDVE 
QLLRQL 


6836 


1 


834 


GlPAADRXEASLELIKLDISRTFPNLCIFQQGGPYHDMijlglriG" 

AY 1 t-iRPDVGYVQGMSFIAAVLILNLDTADAFIAFSNLLNKPCQ 

MAFFRVDHGLMLTYFAAFEVFFEEMTLPKLPAHFKKNNLTPDIYL 

IDWXFTLySKSLPLDItACRIWDVFCRDGEBFLFRTALGILKLFE 

DILTKMDFIHMAQFIiTRI^PBDLPAEELFASIATIQMQSRNKKWA 

QVLTALQKDSREMREGKSVPPTLRLQRSFALGTNQSPMPRPLCC 

FRLTPGQPRRTDAI* 


6837 


1 


850 


M5COKPPPDVDGMITLKV\DNLTVrT^PDSLRRVFEKYGRVGDV " 

YI preejhtkaprgfafvrfhdrrdaqdaeaamdgaeldgrelrv 
qvarygrrdlprsrqgrrhaagpeaa/rygrrsrsygrrsrspr 
rriirsrsrgpscsrsrsrsryrgsrysrspysrspysrsrysrs 
pysrsryresryggshysssgysnsrysryhssrshsksgssts 

£sk*a:> i i»K5>i>SARRSKSSS VSRSRSRSRSSSMTRS PPRVSKRKS 

ksrsrskrppkspeeegqmss 


6838 


1 


1369 


TUGAAVAGNPGSDYPPGGTAPyGGPRTRRP\s6tSSSGSKASGP 

pnppaqgdgtslspnytlestsgndgkpvsggggrgrgrrkrds 

w** V w s- u A 4; i/xs-x ^''vii:'iy£>v3^y/ii;'tj Ptauy yAoGAAVGGSSAGET 
RGAPTPHBKALTSPSWGKGAELLLGDQPDLIGSLDGGAKSDSSS 
PKVGEFASDEVSTSYANEDEVSSSSDNPQALVKASRSPLVTGSP 
KLPPRGVGAGEHGPKAPPPALGLGIMSWSTSTPDSYGGGGGPGH 
PGTPGLEQVRTPTSSSGAPPPDEIHPLEILQAQIQLQRQQFSIS 
EDQPLGLKGGKKGECAVGASGAQNGDSELGSCCSEAVKSAMSTI 
DLDSLMAEHSAAWYMPADKALVDSADDDKTLAPWEKAKPQWPNS 
KEAHDLPANKA5ASQPGSHLQCLSVHCTDDVGDAKARASVPTWR 
SLHSDI SNRF6TFVAALT 


6839 


15 


499 


bTDTPPPKTHMIHHSISDYKATLRCWALGFYPMEITIiTWQQDEE 
DQTRDMELVETRPAGDGTFOKWAAWVPSrjpp /n /t? VMr-uirriui? 

GLPEPLTLRWEQSSQPTIPIVGIVAGLVLLGAWTGAWSAVMC 
RKKNSDRVSYSEAASSDHAQGSDVSLTACKV 




1 


1195 


AAPAGGGPDPEALSAFPGRHLSGLSWPQVKRLDALLSEPIPIHG 
RGNPPTLSVQPRQIRAGGPQHPGCAGMttVHRVRLHGSAASHVL 
HPESGIiGYKDLDLVFRMDLRSEASFQiTKAWLACLLDFLPAGV 
SRAKITPLTLKEAYVQKLVKVCTDSDRWSLISLSITKSGKWVELK 
FVDSVRRQFEFSIDSFQIILDSLLLFGQCSSTPMSEAFHPTVTG 
ESL YGDPTEALEHLRHRVI ATRS PEE I RGGGLLKYCHLLVRGFR 
PRPSTDVRALQRYMCSRFFIDFPDLVEQRRTLERYLEAHFGGAD 

jwuiryaclvtijirwneswcij^herrqtijdliaaialqalae 

2gpaataaiamrppgtdgvvpatvnyyvtpvqpllahayptwlp 

::n 




2061 ] 


SLiQGDFS VPDVPKSMAWCENS ICVGFKRDYYL IRVDGKGS IKEL 
"^PTGKQLEPLVAPLADGKVAVGQDDLTWLNEEGICTQKCAIiNW 
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wo 01/53312 



PCTAJSOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A-Alanine, C==Cysteine, Dispart ic Acid, E= 
Glutamic Acid, F=: Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine^ 
L=Leucine, M=M6thionine, N=Asparagine, 
P=iProline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threonine, Vt=Valine, 
W=.Tryptophan, Y=Tyrosine, X=Unknown, *=stop 
Codon, /=posBible nucleotide deletion, 
\=possible nucleotide insertion) 








TDIPVAMEHQPPYIXAVLPRYVEIRTFEPRLLVQSIELQRPRFI 

TSGGSNI I YVASNHFVWRLI PVPMATQIQQLLQDKQPSLALQLA 
EMKDDSDSEKQQQ IHH I KNLYAPNLPCOlfP PnPQMmrT?^ vtj^tt\ 

PTHVMGLYPDLriPTDYRKQLQYPNPLPVLSGAELEKAHIiALIDY 
LTQKRSQLVKKIiNDSDHQSSTSPLMEGTPTIiCSKKKLLQIIDTT 
LLKCYIiHTNVAIjVAPLLRLENNHCHIEESEHVLKKAHKYSELI I 
LYEKKGLHEKALQVLVDQSKKANSPLKGHERTVQYLQHLGTENL 
HLIFSYSVWVLRDFPEDGLKIFTEDLPEVESLPRDRVLGFLIEN 
FKGLAI P YLEHI IHVWEETGSRPHNCLIQLYCSKVQGLMKE YLL 
SFPAGKTPVPAGEEEGELGEYRQKLLMPLEISSYYDPGRLICDP 

ppdglleeralllgrmgkheqalpiyvhilkdtrmaeeychkhy 

DRNia)GNKDVYLSLLRMYLSPPSIHCLGPIKLEIiIiEPKANLO?UV 
LQVI.ELHHSKLDTTKAI.NbLPANTQINDIRIFLSKVI»EENAQKK 

rfnqvlknllhaeflrvXqeerilhqqvkciiteekvcmvckkk 
ignsaparypngvwhyfcsXkevnpadt 


6841 


1 


3206 


TPSTTGTKSNTPTSSVPSAAVTPLNESLQPLGDYGVGSkNSKRA 
REKRDSRNMEVQVTQEMRNVSIGMGSSDEWSDVQDIIDSTPELD 

mcpetrldrtgssptqgivnkapgintdslyhelstagsevigd 
vdegadllgefsgmgkbv6niille»sqllbtkrai*nwkndliia 
kvdqlsgeqevlrgeleaakqakvklenrikeleeelkrvksea 
iiarrbpkeeaedvssylctbsdkipmaqrrrftrvemarvlme 
rnqykerlmelqeavrwtemirasrehpsvqekkkstiwqffsr 
lfssssspppakrpypsgnihykspttagfsqrrnhamcpisag 
srplepfpdddctssarreqkreqyrqvrehvrnddgrlqacgw 
slpakykqlspnggqedtrmknvpvpvycrplvekdptmklwca 

AGVWIiSGWRPNEDDAGNGVKPAPGRDPLTCDREGDGBPKSAHTS 
PEKKKAKBLPEMDATSSRVWILTSTLTTSKWirDANQPGTWD 
QFTVCNAHVIiCISSI PAASDSDYPPGEMPLDSDVwPTrnDr ann\/ 

iagitlvgcatrcnvprsncssrgdtpvldkgqgbvatiangkv 
npsqsteeateatevpdpgpsepetatlrpgpltehvftdpapt 
pssgpqpgsengpepdssstrpbpepsgdptgagssaaptmmlg 
aqngwlyvhsavanwkkclhsiklkdsvlslvhvkgrvlvalad 
gtlai fhrgedgqwdlsnyhlmdlghphhsircmawydrvwcg 
yknkvhviqpktmqieksfdahprresqvrqlawigdgvwvsir 

LDSTLRLYHAHTHQHIX3DVDIEPYVSKMI/3Tnin.nPCT7\ro tts t 
LVAGSRLWVGTGNGWISIPLTBTVVLHRGQ\LLG\1JIANKTSP 
TSGEG\ARPGG\ I IHVyG\DDSSDRAARSFIPyCSMAQAQLCFH 

ghrdavkffvsvpgnviiatiingsvldspaegpgpaapasevegq 
klrnvlvlsggegyidfrigdgeddetesgagdmsqvkpvlska 

ERSHI I VWQVSYTPE 


6842 


3 


926 


kcoqlsatiltdhqylertplcailkqkapqqyriraklrsykp 
rrlfqsvklhcpkchliiqevphegdldixfqdgatktpdvklqn 
tslydskiwttknqkgrkvavhfvknngxlplsneclllieggt 

LSEICKLSNKFNSVIPVRSGHEDDELLDLSAPFL3QGTVHHYGC 
KQWST*RSIQNLNS1jVDKTSWIPSSVAEALGIVPLQYVFVMTPT 
LDDGTGVLEAYLMDSDKFFQ I PASEVLMDDDLQKSVDMlMDMPC 
PPGIKIDAYPMI.ECFIK3YMVTNGTDNQICYQIFDTTVAEDVI 


6843 
6S44 


2 


SSI 


NHRKVLSGAKkY£CNECGKSPAYTSSLIKHRRIHTGERPYECSE~ 

CGRSFAENSSLIKHLRVHTGERPYECVECGKSPRRSSSLLQHQR 

VHTRERPYECSBCGKSFSLRS^rLIHHQRVHTGERHECGQCGKSF 

SRKSSLIIHljRVHTGERPyECSBCGKSFAEWSSLriCHLHVHTGE 

RPYECIDCX5KSFRHSSSFRRHQRVHTGMRPYK*SKFWICPSCPGF 

LLLQGQRVHTGSRCYECDKWGIFFS*NASFFT*KSAPTEEVPPE 

CNECEKAPSPIiSLVTTlFT 




244 


642 


EHQIiAGFKLRKTQTSMSIiGTTRBKTDRVKSTAYLSPQELEDVFY 
OTVKSEIYSFGIVLWEXATGDIPFQGCNSEKIRKLVAVKRQQE 



561 



wo 01753312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C«Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histxdine, I«Isoleucine, K-Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, RaArginine, 
S=Serine, T=Threonine, V=Valine, 
W« Tryptophan, Y^Tyrosine, X=*Un]cnown, *«Stop 
Codon, /^possible nucleotide deletion, 
\«pos3ible nucleotide insertion) 








PLGEDCPSEIiREIlDECRAHDPSVRPSVDEILKKriSTFSK*CIK 
I 


€845 


3 


1519 


VAVRDECYWRHVFWDQDLWMLtiFIIiMCHPETARARLEYRIRTLD 
GALENAQNL6YQGAKFAWESADSGLEVCPEDIYGVQEVKVNGAV 
GLAFELYYHTTQDLQLFREAGGWDWRAVAEFWCSRVEWSPREE 
KYHLRGVMS PDByHSGVNNSVYTNVtjVQNSLRFAflALAQDr,GljP 
IPSQWtiAVADKIKVPFDVBQNFHPEFDGYEPGEWKQADWLLG 
YPVPFSLSPDVRRKNIjE I YEAVTS PQGPAMTWSMFAVGWMELKD 
AVRARGliLDRS FANMAEPFKVWTENADGSGAVNFLTGMGGFLQA 

WFGC1Y?FPVT'RRnVTPno\7r'T.Qr2T QPXrOlforiTE»vnr»XTVT ht-cc-c 
V V c n. V J. tsjwav ± r iJt' v ^Ju&v»XoiC vo Vov»J. r lU^MJUjff £ SF 

SEDSVTVEVTARAGPWAPHLEAELWPSQSRLSLLPGHKVSFPRS 
AGRIQMSPPKLPGSSS S EPPGftTFSDVRDPLQS PLWVTLGSSSP 
TESLTVDPAS E *SGTGASETSLGPSLWPRLHPPLLGTLLACHPS 
PAARLSGKVHAAWPBFKAFCL 


6846 


213 


1258 


LYFIjKTIK*IiNRIjAEHP*YENEKLTKLRNTIMEQyTRTEESARG " " 

1 1 F TKrrKQSAYALSQW I TENEK FAE VG VKAHHL I GAGHS SEFK P 

MTQNBQKBVISKPRTGKINLLIATTVAEEGLDIKECNIVIRYGL 

VTNEIAMVQARGRARADESTYVLVAHSGSGVIEHBTVNDFREKM 

MYKAIHCVQNMKPBEYAHKILSIiQMQSIMBKKMKTKRNIAKHYK 

NNPSLlTFLCKNCSVIiACSGEDIHVIEKMHHVNMTPEFKEX.YIV 

RBNKTIiQKKCADYQINGEirCKCGQAWGTMMVHKGUJLPCLKIR 

NFWVPKNNSTKKQYKKWVELPITFPNriDYSECCLFSDED 


6847 


1450 


348 


SMCWNSDRLEMPLXDLAIilLYPPSYVPYTGHIiSDDSLSRKYCL'I* " 
WFBDALNGVL*RAEAIQPHCVNAGDRMEKFRQKYWNKLQTIJIQQ 
PFAYGTLTVRSLLDTREHCLNE FN FPD PYS KVKQRENGVALRCF 
PGVVRSIXtAIiGWEERQIiALVKGLLAGNVFDWGAKAVSAVIjESDP 
YFGFEEAKRKLQERPWLVDS YSEWLQRLKGPPHKCALl PADNSG 
ID 1 1 liGVFPFVRELLIiRGTEVlIiACNSGPALNDVTHSESLIVAE 
RIAGMDPWHSALREERLLIiVQTGSSSPCLDLSRLDKGIAAIjVR 
ERGADLWIEGMGBtAVHmSfHAAIiRCESLKLAVI KNAWLAERLG 
GRLFS VT PKYF VPAP 


6848 


19 


16 


AMWWNSLDGIRNIVLSWPKKRNTLSLAMLKSLQSDIIJIDADSND 
LKVII XSAEGPVFS SGHDLKELTEEQGRDYHABVFQTCSKVMMH 
IRKHPVPVIAMVKGLATAAGCQLVASCDIAVASDKSSFATPGVN 
VGLFCSTPGVAIARAVPRKVAIiEMIiFTGEPISAQEAI*LHGLLNK 
WPEABLQEETMRrTU^IASLSRPWSIiGKATFYKQIiPQDLGTA 
YYLTSQAMVDNIiALRDGQEGITAFLQKRKPVWSHEPV*VEH 


6849 


70 


821 


SLGVDGSCLEQGSPAPRPQTDTSP * P VGNWATQQEDLYHQS YEC 
VCVLFASVPDFKEF YSESNINHEGLECLRLLNE I lADFDELLS K 
PKPSGVEKIKTIGSTYMAATGLNATSGQDAQQDABRSCSHLGTM 
vefavalgskldvtnkhs fnnfrlrvg lnl^p wag vigaqkpq 
YDIWGNTVNVASRMESTGVIiGKIQVTEBTAWAIjQSIjGYTCYSRG 
VIKVKGKGQI*CTYPLNTDLTRTGPPSArLG 


6850 


2 


1235 


ARGIiNHE WTFEKIiRQHI SRNAQDKQELHIjFMLSGVPDAVFDLTD " 
LDVIjKLELIPEAKI PAKISQMTNLQELHLCHCPAKVEQTAFS PL 

rdhlrclhvkftdvaeipawvyujknlrelylignlnsennkmi 
gleslrelrhlkilhvksnltkvpsnitdvaphltklvihwdgt 

KLLVLnSLKKMMKVAELELQNCELERIPHAIFSLSNLQELDLKS 
NNIRTl EEI IS FQHLKRLTCLKLWHNKI VTI PPS I THVKNLESL 
YFSNNKLESLP VAVFSLQKLRCLDVS YNNISMI P IE IGLLQNLQ 
HLHITGNKVDILPKQLFKCIKLRTLNLGQNCITSLPEKVGQLSQ 
LTQLELKGNCLDRLPAQLGQCRMLKKSGLWEDHLFDTLPLEVK 
EALNQDINIPFANGI 


6851 


176S 


660 


VSAQVSAREQENCLGWNLADSSQESYKSIiEEAEDCYPPSLLTLD 
LRDLFWQVEQGPH.SCPKAGTDLSMGRARBVGWMAAGLMIGAGA 
CYCVYKLTIGRDDSEKtiEEEGEEBWDDDQELDEEEPDIWFDFET 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
cor re spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence « 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide " 
(A=Alanine, C^Cysteine, D=Aspartic Acid, E= 
Glutamic Acid. F» Phenyl alanine, G=Glycine, 
H=Histidine, l=Isoleucine, K=Ly3ine, 
L=Leucine , M=Methionine , N=:Asparagine , 
P^Proline, Q^Glutamine, R=Arginine, 
SsSerine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=:Stop 
Codon, /=:^possible nucleotide deletion, 
\«possible nucleotide insertion) 








MARPWTEDGDWTEPGAPGGTEDRPSGGGKANRAHPIKQRPFPYE ' 
H KNT WS AQNCKNGS CVLDbSKCL FI QGKIiL FAE PKDAG FP FSQD 
INSHLASLSMARNTSPTPDPTVREALCAPDNLNASIESQGQIKM 
YINE VCRETVSRCCNS FLQQAGLNLLISMTVINNMLAKS AS DLK 
FPLISEGSGCAKVQVLKPLMGLSEKPVLAGELVGAQMLPSPMSL 
PIRWGNREILLETPAP 


6852 


1 


407 


RTRG EETYANP I KHNDG KNI FYAARTPATLFAVM FAMY 1 1 SGIiT " 
GPI GLNS lAVLCNLVMGLADI FIiCTWAYVKYSGEFRE IGTVI DQ 
IAETLWEQVIjKPLGDNLMEENIRQS VTNS IKAGLTDQVS HHARL 
KTD 


6853 


3 


469 


GDS CAVCIELYKPNDL vr iltcnhi phktcvdpwllehrtcpmc 

KCD ILKALG IE VDVEDGSVSLQVPVSNE I PETSASSHEEDNRSBT 
ASSG YAS VQGTYE PPLEEHVQS TNE SI*QLVNHBANS VAVD VI P H 
VDNPTFEEDETPNQETAVREI KS 


6854 


1143 


585 


HESYIGTFDPGELCVCAAIQWLQDNSASYFLNRKLVYEPSTQAK 
PVKNTFLRMWIYSHHIYQQDlJRKKIIiDVGKRLDVrGFCMTGKPG 
I ICVEGFKEHCEE FWHT I RYPNWKHIS CKHAESVETEGNGEDLR 
LPHS PEELLLEAHGDYGLRNDYHMWIiGQFLEFLKKHKSEHVFQI 
LFGIESKSSDS 


6855 


1913 


1148 


GRVGGRVGRICSPLSGANEYIASTDTLKTEEVLLFn>QTDDLAK " " 
EEPTSLFQRDSETKGESGIiVIiEGDKEIHQlFEDLDKKliALASRF 
YIPEGCIQRWAAEMWALDAiaREGIVCKDliNPNNILLIIDRGHI 
QLT YFSRWSE VEDSCDSDAI ER MYCAPE VGA I TEE TE ACDWWS L 
GAVLFELLTGKTLVECHPAGINTHTTLNMPEWVSEEARSLIQQL 
LQFNPLERLGAGVAGVEDI KSHPFFTPVDMAELMR 


6656 


1617 


' 997 


VTQLYVSVDASTKDSLKKIDRPLPKDPWQQFLDSLKAliAVKQQR 
TVYRLrLVKAWNVDELQAYAQLVSLGNPDFIEVKGVTYCGESSA 
SSLTMAHVPWHEEWQFVRELVDIiIPEYEIACEHEHSNCLLIAH 
RKFKIGGEWWTWINYKRFQELIQEYBPSGGSKTFSAKDYMARTP 
HWALFGASERGFDPKDTRHQRKNKSKAISGC 


6857 


1 


617 


KGPEATAMVCVCSHPNCRQNHtKPSHSAAQTMCGSPTPASAPNH 
KLMAMEQGKTLPSATEDAKEEGLEJ^Q ISRLAELIGRLESKALWF 
DLQQULSDEDGTNMHLQLVRQEMAVCPEQLSBFLDSLRQYIjRGT 

tgvrncphitavrlsdgptfviyefweteeawkrhlqsplckaf 
rhvkvdtlsqpealsrilvpaawctvgrd 


6858 


2 


6S9 


rsrgikdfendpplsscgifqsriagdalldsgirissvfaspa ■ 
lrcvqtaklileelklekkikirvbpgifewtkweagkttprlm 

SIiEELKElANFN IDTD YRPAFPLSALMPAES YQE YMDRCTAS MVQ 
IVNTCPQDTGVILIVSHGSTLDSCTRPLLGLPPRECGDFAQLVR 
KIPSLGMCFCEENKEEGKWBLVNPPVKrLTHGANAAFNWRNWIS 
GN 


6859 


1 


1150 


GETMFKKAKTKAKKKPRKRSDSSGGYNIiSDlIQSPSSTGLLKSG 
KTKSVESLPEIiLTSDSEGSYAGVGSPRDLQSPDFTTGFHSDKIE 
AKVKPYVNGTSPVYSREDLKPWEKSPILKISAPQPIPSNRIDTT 
SSASWVAGSPSPVSPPWDLRTIMEIEESRQKCGATPKSHLGKT 
VSHGVKLSQKQRKMIALTTKENNSGMNSMsrviiFTPSKAPKPVJT 
AWASS LHS VS SKSPRDFLLEEKKSVTSHSSGDHVKKVSFKG I EN 
SQAPKIVRCSTHGTPGPEGNHISDLPLUDSPNPWLSSSVTAPSM 
VAPVTFASIVEEEriQQEAALIRSREKPLALXQIEEHAIQDLLVF 
YEAFGNPEEFVIVERTPQGPLAVPMWKKHGC 


6860 


1889 


151S 


DKDKKRQKKRGIFPKVATNIMRAWLFQHLTHPYPSEEQKKQIiAQ 
DTGLTI LQVNNWPINARRI IVQPMIDQSNRAVSQGAAYSPEGQ P 
MGSFVLDGQQHMGIRPAGPMSGMGMNMGMDGQWHYM 


6861 


1889 


1515 


DKDKKRC3IKKRGIFPKVATNIMRAWLFQHLTHPYPSEEQKKQLAQ 
DTGLTI LQVNNWPINARRI I VOPMIDQSNRAVSOGAAYSPEGQP 
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SEQ 
ID 
NO : 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D^Aspartic Acid, E« 
«xutaniic AciUf r^fnenyxaianme, o=G±yc3.ne/ 
HisHistidine, I=Isoleucine, K^Lysine, 
LoLeucine, M=Methionine, N^Asparagine, 
PsProlxne, Q=Glut amine, R^Arginine, 
S=*Serine, T=Threonine» V^Valine, 
W=Tryptophan, Y=:Tyrosine, X==Unknovm, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possii>le nucleotide insertion) 








MGSFVLDGQQHMGIRPAGPMSGMGMNMGMDGQWHYM 


£S€2 


2 


471 


EEIDREFHNKLKIiKEDKLEKQEKPVNGEDKGDSGVDTQNSEGNA 
DEEDPLGPNafYDKTKSFFDNISCDDNRERRPXWAEERRLNAET 
FGIPIiRPKRGRGGYRGRGGLGFRGGRGRGGGRGGTPTAPRGFRG 
GFRGGRGGRE FADFEYRKTTAFG P 


6863 


2216 


487 


PQEPALKSEFSQVASNTIPLPLPQPNTCKDNGPCKQVCSTVGGS 

ill Jb^ I AX rWUJla V o UliUy Uii 

y CVNHTVIiCADGY ILNAHRKCVDI MSCVTDLHTCSRGEHCVNTIj 
GSFHCYKALTCEPGYALKDGECEDVDECAMGTHTCQPGFLCQNT 
KGSFYCQARQRCMDGFLQDPEGNCVDINECTSLSEPCRPGFSCI 
imraSYTCQRNPLICARGYHASDDGTKCVDVNECETGVHRCGEG 
QVCHNLPGSYRCDCKAGFQRDAFGRGCIDVNECWASPGRIiCQHT 
CENTLGSYRCSCASGFLLAADGKRCEDVNECEAQRCSQECANIY 
GSYQCYCRQGYQLAEDGHTCTDIDECAQGAGILCTFRCLNVPGS 

yqcacpeqgytmtangrsckdvdecalgthncseaetchnicxss 
frclrpecppwyvqvsktkcerttchdflecqwsparithyqln 

FQTGLIiVPAHI FRIGPAPAFTGDTI ALNI IKGNEEGYFGTRRI^N 
AYTGWYLQRAVLEPRDFALDVEMKLWRQGSVTTPLAKMHIFFT 
TFAL 


6B64 


2 


2933 


LADSSPSNLQIIIKELLSMHHQPDPALrKEFDYLPPVDSRSSSG 

fvglrnggatcymnavfqqlymqpglpesllsvdddtdnpddsv 

FYQVQSLFGHLMESKLQYYVPENFWKIFKMWNKEIjYVREQQDAY 
EFFTSLiIDQMDEYLKKMGRDQIFKNTPQGIYSDQKICKDCPHRY 
ereeapmalnlgvtscqsle r SLDQ FVRGEVT.EGSNAY yce kck 

EKRI TVKRTC I KSIiPSVLVIHIWRPG FDWBSGRS I KYDEQ I RFP 
WiynbNMEPYTVSGMARQDSSSEVGENGRSVIDQGGGGSPRKKVALT 
ENYEbVGVIVHSGQAHAGHYYSFIKDRRGCGKGKWYKFNDTVlB 
EFDLNDETLEYECFGGEYRPKVYDQTNPYTDVRRRYWNAYMLPY 
QRVSDQNSPVLPKKSRVS WRQEABDI/SLSAPSS PE I SPQ S SPR 
PHRPNNDRLS ILTKLVKKGEKKGLFVEKMPARIYQMVRDENliKF 
MK^mDVYSSDYFSFVLSrJ^LWATKLK^^'YYPCMAKVSLQLAIQ 

FLFQTYLRTKKKIjRVDTBEW r atieai*lsksfdacqwi*veypi s 

SEGRELIKIFLLBCNVRBVRVAVATIIiEKTLDSAbFYQDKLKSL 
HQLLBVUiALLDKDVPENCKNCAQYFFLFNTFVQKQGIRAGDLL 

VLHSDVSSQRNVAPGIFKQRPPIS LAPS SPLLPLHEEVEALLPM 
SBGKPYLLEVMFAliRELTGSLLALIEMWYCCFCNEHFSFTMLH 
FIKNQbETAPPHELKNTFQLLHEiriVIEDPIQVERVKFVFETEN 
GLLALMHHSNHVDSSRCYQCVKFLVTDAQKCPAAKEYFKENSHH 
WSWAVQWIiQKKMSEHYWTLQSNVSNETSTGKTFQRTISAQDTIiA 
YATALLNEKEQSGSSNGSESSPANENGDRHLQQGSESPMMIGEL 
RSDLDDVDP 


6865 


1820 


1242 


DPERWKHLSKVTPPGSSVSTTPVQWRLQSPQSQGSMMPSCNRS 
CSCSRGPSVEDGKWYGVRSYLHLFYEGYAVPPKLEGIGEGEFLV 
LDQRAADYKQALGTCRLAGTALCVAAGVLIiAICLFWAMtGWLSQ 
DTKAEPLDPSADSHVEVFGDEPEQQLSPIFRNASGQSWFSPPAS 
PFGQSSVQTIQPKRDS 


6866 


1571 


495 


DCPRPRVTLYGLRATCMRDLDWAWINAVSAFKALEQDLPVNIKF 
I lEGMEEAGS VALEELVEKE KDRFPSGVDYIVISDNLWI SQRKP 
AITYGTRGNSYPMVEVKCRDQDFHSGTPGGILHEPMADLVALLG 
SLVDSSGHILVPGIYDEWPLTEEEINTYKAIHriDLEEYRNSSR 
VEKFDFDTKEEILMHLWRYPSLSIHGIEGAFDEPGTKTVIPGRV 
IGKFS IRLVPHMNVSAVEKQVTRHLEDVFSKRWS SKKMWSMTL 
GLHPWXANIDDTQYIiAAKRAIRTVFGTEPDMI RDGSTI P lAKMF 
QEIVHKSWLIPLGAVDDGEHSQNBKINRWNYIEGTKLPAAFFL 
EMAQLH 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corre spending 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Qlycine, 
H=Histidine, I=Isoleucine, k=Lysine, 
L-Leucine, M^Methionine, N=Asparagine ^ 
P=Proline, Q=Glutamine, R=Arginlne, 
S=Serine, T=Threonine, V=5Valine, 
W^Tryptophan, Y=Tyroeine, X=Unknown, *«»stop 
Codon, /^possible nucleotide deletion, 
\s=possxbl6 nucleotide insertion) 


6867 


2833 


1704 


GTRIMSQPKQKELAGFVRQKiyiLLDYSVyMGRCVPQESRSPQRSP 
LQSAESSPTAGKKLPEVPPSEEEEQEAWVNAILGRIFWDFLGEK 
yWSDLVSKKIQMKLSKIKIiPYFMNELTLTELDMGVAVPXILQAP 
KPYVDHQGLWI DLEMS YNGS PLMTLETKMNLTKLGKEPLVEALK 
VGE IGKEGCRPRAPCLADS DEESS S AGSSEEDDAPEPSGGDKQL 
LPGAEGYVGGHRTSKIMRFVDKITKSKYFQKATETEPIKKKIBE \ 
VSNTPLLLTVEVQBCRGTIiAVNIPPPPTDRVWYGFRKPPHVELK 
ARPKLGEREVTLVHVTDWIEKKLBQEFQKVFVMPNMDDVYITIM 
HSAMDPRSTSCLLKDPPVEAADQP 


6868 


1 


346 


RPTRPPTRPEBIKWLILPYISDMWFVQDLCEDFyELFKTDKGFD 
KATFESQMS VMRGQILNLTQAIiRDGKSPFQLVQ I PCVI VERSQG 
GSQGRI VHLSKS FTQTVNCRKPFPSS W 


^869 


3 


1619 


MYMBRMDICRALIS FWES VEHLKNANKNEIPQIjVGE I YQNFFVES 
KEIS VEKSLYKE I QQCLVGNKGI EVFYKIQEDVYETLKDRYYPS 
PIVSDLYBKLLIKEBEKHASQMISNKDEMGPRDEAGEEAV0DGT 
NOINEQASFAVNKLRELNBKLEYKRQALNSIQNAPKPDKKIVSK 
LKDEI ILXEKERTDIjQLHMARTDWWCBNLGMWKAS ITSGEVTEE 
MGEQLPCYFVMVS LQEVGGVETKNWTVPKRLSE FHNLHRKLSEC 
VPSLKKDQIiPSLSKLPFKS IDHTFMEKPENQLMKFIiQNIiIjS DER 
LCQSEALYAPLSPSPDYLKVIDVQGKKNSFSLSSFLERItPRDPF 
SHQEEETEEDSDLSDYGDDVDGRKDAtiAEPCFMLIGEI FEI>RGM . 
FKWVRRTLl ALVQVTFGRT INKQIRDTVSWI PSEQMLVYYINI F 
RDAFWPNGKLAPPTTIRSKEQSQETKQRAQQKLLENIPDMLQSL 
VGQQNARHGIIKIFNALQETRANKHLLYALMELLLIELCPELRV 

mil /wmvrwjar^ v 


6870 
• 


1 


1566 


MAAWAATRWWQLLLVLSAAGMGASGAPQPPNILLLriMDDMGWG 
DLGVYGEPSRETPNIiDRMAAEGLLFPNFYSANPIiCSPSRAAIiLT 
GRLP IRNGPYTTNAHARNAYTPQEI VGGI PDSEQLLPELLKKAG 
YVSKIVGKWHLGHRPQFHPLKHGFDEWFGSPNCHFGPYDNKARP 
ITIPVYRDWEMYGRYYEEFP INLKTGEANLTQI YLQEAIiDFI KRQ 
ARHHPFFLYWAVBATHAPVYASKPPLGTSQRGRYGDAVRBIDDS 
i J. Li liljijyiijjH VALJN 1 tfM Ft! SDNGAAIj I S APEQG6S NGP FI*C 
GKQTTPEGGMREPAIAWWPGHVTAGQVSHQLGSIMDLFTTSLiAL 
AGLTPPSDRAIDGLNLLPTLLCJGRLMDRPIFYYRGDTIiMAATLG 
QHKAHFWTWTNSWEWFRQGIDFCPGQNVSGVTTHNLEDHTKLPL 

QLNVCNWAVMNWAPPGCEKLGKCLTPPESIPKKCLWSH 


6871 


209 


1126 


RMSLNPPI PLKRSEENSSKFVETKQSQTTS lASEDPLQNLCLAS 
QEVLQKAQQSGRSKCIiKCGGSRMFYCYTCYVPVENVPIEQIPLV 
KLPLKIDIIKHPNETDGKSTAIHAKLLAPEFVWIYTYPCIPEYE 
EKDHEVALIFPGPQSISXKDISPHLQKRIQNNVRGKNDDPDKPS 
PKRKRTBBQEFCDLWDSKCKGTTLKKIIFIDSTWNQTNKIFTDE 
RLQGLLQVELKTRKTCFWRHQKGKPDTPLSTIEAIYYFIiVDYHT 
DILKEKYRGQYDNl.LFFYSFMYQIiIKWAKCSGDKBTGia.TH 


6872 


880 


459 


FGLLMWLS LI FMKGNCVREDLI FNFLPKIiGLDVRETNGLPGNT 
KKLITEyPVRQKYLEYRRIPYTEPAEYEFLWGPRAFLETSKMLV 
LRFLAKLHKKDPQSWPFHYLEALAECEWEDTDEDEPDTGDSAHG 
PTSRPPPR 


6873 


1929 


955 


DEQAVLCSKDKTYDLKrADTSNMLLF I PGCKTPDQLKKEDSHCW 
I IHTE I FGPSNNYWELRRRRPKLKKLKKIjLMENP YEGPDSQKEK 
DSWSSKYTTEDLLDQIQASEEEIMTQLQVLNACKIGGYWRILEF 
DYEMKLLNHVTQLVDSESWSFGKVPLNTCLQELGPLEPEEMIEH 
CLKCYGKKYVDEGEVYFELDADKICRAAARMLLQNAVKFNLAEF 

qevwqqsvpsgmvtsldqlkglalvdkhsrpeiifllkvddlpe 
dnqbrfnslfslrekwteediapyiqdlcgekqtigalltkysh 
ssmqngvkvynsrrp r s 
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SEQ 
ID 
NO: 


"""Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C^Cysteine, D^Aspartic Acid, 
Glutamic Acid, FsPhenylalanine, G=Glycine, 
H=Histidlne, I=Isoleucine, K=Lysine, 
L=Leucine, Ms:Methionine^ N^Asparagine , 
P=Proline, Q=Glutamine,. R=^Arginine, 
S=S erine, T«= Threonine, V= Valine, 
W«Tryptophan, Y=Tyroaine, X«UnJcnown, *=Stop 
Codon, /=pos3ible nucleotide deletion, 
\=possible nucleotide insertion) 


6874 


1 


307 


DSIADHVNSAAVNVEEGTKNLGKAAKYKLflALPVAGALIGGMVG 
GP I GLLAGFKVAGXAAALGGGVLGFTGGKLTQRKKQKMMEKXiTS 
SCPDLPSQTDKKCS 


6875 


1688 


349 


VIGTGERGNSASEKWEIMFNEELGDPFIIIHSISLI^AEEHSIA 
TliLLRIEKEELDMKGSGFYVSIiEWVTISKKNQDNKKYEIIKRDI 
LRGKSVPHyAAIEPDGNGLMIVSYKSLTFVQAGQDIiEENMDEDI 
SEKI KEPLY YWQQTED0LTVTI RLPEDNTKED I QIQFLPDHINI 
VUCDHQFLEGKLYSS IDHESSTWIIKESNSLE I SLI KKNEGLTW 
PELVIGDKQGELIRDSAQCAAIAERLMHLTSEELNPNPDKEKPP 
CNAQELEECDIFFEESSSLCRFDGNTLKTTHWWLGSNQYLFSV 
IVDPKEMPCFCLRHDVDALLWQPHSSKQDDMWEHIATFNALGYV 

Sjn^£yj\urijfi.c r/\V-*ttxriN i to I/\rtJLiwJ5L.JjKKVr J. XKyi'AFMfa 1 VJjYNR 

KEGRQVGQVAKQQVASLETNDPILGFQATNERIiFVLTTKNLFLI 
KVNTBN 


5876 


41 


1285 


VGEMTIjlWRHIiIjRPrjCLVTSAPRILEMHPFLSIiGTSRTSVTKI.S 
LHTKPRMPPCDFMPERYQVIPLVNSGSEANSLAMLMARAHSNNI 
DIISFRGAYHGCSPYTLGLTNVGIYKMELPGGTGCQPTMCPDVF 
RG PWGGSHCRDS P VQTIRKCS CAPD CCQAKDQ Y I EQFKDTIjSTS 

vaksiagpfaepiqgvngwqypkgflkeafelvrarggvcian 
evqtgfgrlgshfwgfqthdvlpiuvtmakgigwgfpmaavitt 
peiakslakclqhfntfggnpmacaigsavlevikeenlqbnsq 
bvgtymiilkfaklrdefeivgdvrgkgiimigiemvqdkiscrpl 

PREEVNQIHEDCKHMGLLVGRGS I FSQTFRIAPSMCITKPE VDF 
AVE V FRS ALTQHMERRAK 


6877 


1 


778 


GTS PSPARAYAPPTERKRFYC3iNVS ITQGEGGFEINLDHRKLKTP 
QAKLPTVPSEALAIAVATEWDSQQDTIKYYTMHLTTLCNTSLDN 
PTQRNKDQLIRAAVKFLDTDTICYRVBEPETLVEUQRNEWDPII 
EWAEKRYQVE ISSSTSIMGPS IPAKTREVLVSHLAS YNTWALQG 
IEFVAAQLKSMVLTLGLIDLRLTVEOAVIiLSRLEEE YQ X QKWGN 
lEWAHDYBLQEIiRARTAAGTLFIHLCSESTTVKHKLLKE 


6878 




263 


t^± iiyvyf x» ]\J?ii<JiaiyixiJJraJ,K± iLN V 1 KiiiJAfoKlKCE VSAPSEQGQN 
LEEDTVTIiEVLVAPAVPSCEVPSSALSGTWELRCQDKEGNPAP 
EYTWFmSIRLLENPRLGSQSTNSSYTMNTKTGTLQFNTVSKIiD 
TGEYSCEARNSVGYRRCPGKRMQVDDr^ISGIIAAVWVALVtS 
VCGIiGVCYAQRKGYFSKETS FQKSNS SSKATTMSBNDFKHTKSF 
II 


6879 


3 


845 


I RVIGESDIMOE FL SESDEW yWK Vfi D VR LP VAL pnR'TTUT ' 
KNSTTDQVYQAIAAKVGMDSTTVNYFALFEVISHSPVRKLAPKTE 
PPHKLYIQNYTSAVPGTCLTIRKWLFTTEEEII.LNDNDLAVTYF 
FHQAVDDVKKGY I KAEEKS YQIjQKL YEQRKMVMYLNMLRTCEGY 
rJEIIFPHCACDSRRKGHVlTAISITHPKLHACTEEGQLENQVrA 
FEWDEMQRWDTDBEGMAFCFEYARGBKKPRWVKIFTPYFNYMHE 
CFERVFCELKWRKBBY 


6680 


2110 


1437 


RKDNCTAKEWTFPEAKWWTTARVFSHIRLGMGHVLIIVQCFISS 
MANIYNEKILKEGNQLTESIFIQNSKLYPFGILFNGLTLGLQRS 
NRDQI KNCGFFYGHRAPSVALIFVTAFQGLSVAFILKFLDKMFH 
VLMAQVTTVIITTVSVLVFDFRPSLEPFLEAPSVLIjSIFIYNAS 
KPQVPEYAPRQERIRDIiSGNLWERSSGDGEELERLTKPKSDESD 
EDTP 


6861 


2638 


2244 


NDSKWSDIHVITGALKMFFRELPEPLFrFNHFWDFVNAIKQEPR 

qrvaavkdlirqlpkpnqdtmqilfrhlrrviengekmrmtyqs 
laivfgptllkpeketgniavhtvyqnqivelillelssifgr 


6882 


1 


850 


GI PEAQLWiyPVKSCKGVP VS EAECTAMGLRSGNLRDRFWIiVIN 
QBGNMVTARQBPRLVLISLTCDGDTLTLSAAYTKDIir/LPlKTPT 
TNAVHKCRVHGLEIEGRDCGEATAQWITSFLKSQPYRLVHFEPH 
MRPRRPHQIADLFRPKDQ lAYSDTS PFLI IiSEASLADLNSRLEK 
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SEQ 
ID 
NO; 


Predicted 

beginning 

nucleotide 

location 

c or re sp ond i ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


AlTli.no acid SPam&Ht com t*A"i t>"i ncr fiicmal 'Thisvr^t- -1 rf^tt 

(A?=Alanine, G= Cysteine, D«sAspartic Acid, E=5 
Glutamic Acid, F« Phenyl alanine, G=Glycine, 
H«Histidine, I=Isoleucxne, K=Lyslne, 
L=I*eucine , M=Methionine , N=Asparagine , 
P= Pro line, Q=: Glut amine , R^Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan , YaTyros ine , Xi=Unknown, * ^^S top 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








KVKATNFRPNIVISGCDVyAEDSWDELIilGDVELKRVMACSRCI 
LTTVDPDTGVMSRKEPLETLKSYRQCDPSERKLyGKSPriFGQYF 
VLENPGTIKVGDPVYLLGQ 


6883 


2794 


2256 


NSKLKLNQNLKLFITLTYQVLSLHGWGPGIHLQKEGAFPVTQNR 
ALQLLYDLRYLKIVLTAKGDEVKSGRSKPDSRIEKVTDHLEALI 
DPFDLDVPTPHLNSNLHRLVQRTSVLFGLVTGTENQLAPRSSTF 
NSQEPHNILPLASSQIRFGIjIjPLSMTSTRKAKSTRNIETKAQYD 
ANC 


6884 


2 


99 


EFERVTAEAVKPRETSEPRAAAQRFCEKPPFL 


6885 


297 


1554 


STGQFWHVTDLHLDPTYHrTDDKTiC\/CASSKGANASNPGPR3DV 
LCDS PYQLILSAFDF I KNSGQEAS FMI WTGDS PPHVPVPELSTD 
TVINVITNMTTTIQSLFPNLQVFPALGNHDYWPQDQIiSWTSKV 
YWAVANLWKPWLDBEAISTLRKGGF^SQKVTTNPNLRIISIiNTN 
LYYGPNIMTLNKTDPAMQPEWLESTLNNSQQNKEKVYIIAHVPV 
GYLPSSQNITAMREYYNEKLIDIFQKYSDVIAGQFYGHTURDSI 
MVIiSDKKGSPVNSIiFVAPAVTPVKSVLEKQTNNPGIRLFQYDPR 
DYKLLDMLQYYIiNLTEANLKGESIWKLEYILTQTYDIEDLQPES 
L YGLAKQPT ILDSKQP I KY YNYFFVS YDSSVTCDKTCKAPQ I CA 
IMNLDNISYADCLKQLYIKHNY 


688S 


2 


134X 


QCGG I PGREGGSSRPLEEGTGSSPACVRGAAPGSEDAF YPTRAK 
QARVSQELKKAAKRTVS ISEGPDTLGDGMRERRETLALAPEPEP 
LEKEACEKWKRPFRSASATSLTLSHCVDWKGLLDFKKRRGHSI 
GGAPEQRYQIIPVCVAARLPTRAQDVLDAHLSEVWAVRFGPNSS 
LLATGGADRLIHLWNWGSRLEANQTLSGAGGSITSVDPDPSGY 
QVLAATYNQAAQLWKVGEAQSKETLSGHKDKVTAAKPKLTRHQA 
VTGSRDRTVKEWDLGRAYCSRTINVLS YCND WCGDHI I ISGHH 
DQKIRFWDSRGPHCTQVIPVQGRVTSLSLSHDQLHLLSCSRDNT 
LKVIDIjRVSNIRQVFRADGFKCGSDWTKAVFS PDRS YALAGS CD 
GALYIWDVDTGKLESRLQGPHCAAVNAVAWCYSGSHMVSVDQGR 


68S7 


1047 


116 


WTARPSQKPPWEAGAVPGDPLSTGCSQAQLGGCCPRGPWGPQHG 
GQQRAAGPTLPRGERGGPQQSGPGLAAQTPPTSKQVAWRAFLTG 
TYRSQS PRSPAGP PRGGTGWWPEPAVCLCVAVGPQRLS S PGLVY 
nasgsehcydiyrlyhscadptgcgtgpdarawdyqactelniit 
FASNNVTDMFPDLPFTDELRQRYCLDTWGVWPRPDWLLTSFWGG 
DLRAASNI 1 FSNGNLDPWAGGGI RRNLS AS Vr AVTI QGGAHHLD 
IiRASHPEDPASWEARKLEATI IGEWVKAARREQQPALRGGPRL 
5L 


€888 


1 


992 


FVAYVKKEI PHI WTHCLLNPHALVI KTLPTKLRDALFTWRVI 
NFIKGRAPNHRLPOAFFBEIGIEYSVLLFHTEMRWLSRGQILTH 
IFEMYEE INQPLHHKSSNLVDGFBNKEFKIHIiAYLADLFKHLNE 
LSASMQRTGMNTVSAREKLSAFVRKFPFWQKRIEKRNFTNFPFL 
EBI IVSDNEGIFIAAEITLHLQQLSNFFHGYFS IGDLNEASKW I 
LDPFI*FNI DFVDDS YLMKNDIiAEIiRASGQ I LME FETMfCLEDFWC 
AQFTAFPNLAKTALEILMPFATTYLCELGFSITFTFQNKVPEAA 
IiILSDDIRVArSKKVPSFLGHH 


6889 


1 


X534 


LTLENQIKEEREQDNSESPNGRTSPLVSQNNEQGSTLRDLLTTT 
AGJCLRVGSTDAGIAFAPVYSMGAPSSKSGRTMPNILDDIIASW 
ENKI PPSKTS KINVKPBIiKEEPEES I IS AVDENNKLYSDI PHSW 
ICEKHILWLKDYKNSSNWKLFKECWKQGQPAWSGVHKKMNriSL 
WKAESISLDPGDHQADLLNCKDSI rSNANVKEFWDGPEEVSKRQ 
KNKSGETWLKLKDWPSGEDFKTMMPARYEDLLKSLPLPEYCNP 
EGKPNLASHLPGFFVRPDLGPRLCSAYGVVAAKDHDIGTTIILHI 
BVSDWNILVyVGIAKGHGILSKAGILKKPEEEDIiDDILRKRLK 
DSSEIPGALWHIYAGKDVDKIREPLQKISKEQGLEVLPEHDPXR 
DQSWYVNKKLRQRLLEEYGVRTWTLIQFLGDAIVLPAGALHQVQ 
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PCT/USOO/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C«Cysteine, D^partic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, Kti^Lysine. 
L=Leucine, M=Methionine, N^Asparagine, 
P=Prol ine , Q=Glutaraine , R=Arginine , 
S=5erine, T=Threonine, V= Valine, 
W^Tryptophan, Y«Tyrosine, X=:trnknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








NFHSCIQVTEDFVSPEHLVESFHIiTQELRIjLKEEINYDDKLQVK 
NILVHAVKEMVRAIiKIHEDEVDDMEEN 


S890 


3 


667 


THACGMWI PLYLMRALWHKTAETCNSPP CGAKDS h I FGAITCF 

TGFLGVDTGAGATRMCRLKTQRADPLVCAVGMLGSAI FICLI FV 

AAKSSIVGAYICIFVGETLLFSNWAITADILMYWIPTRRATAV 

ALQSPTSHLLGDAGSPYLrGPlSDLIRQSTKDSPLWEFLSLGYA 

LMLCPFWVLGGMFFLATALFFVSDRARAEQQVNQIiAMPPASVK 
V 


S891 


i980 




LR IHOELLS KELKLLRG I TI E S i 1 H IRT Ari If POPwrnn^' QNTirMri 
LLLKTQSHLYMMEDWNPEVRQAAAYGLGVMAQFGGDDYRSLCSE 
AVPLLVKVIKRAHSKTKKNVIATENCISAIGKILKPKPITCVNVD 
EVLPHWLS WLPLHEDKEEAIQTIiS FLCDL lESNHP Wl GPNNSN 
LPKirsiIAEGKINETINYEDPCAKRLANWRQVQTSEDLWLEC 
VSQLDDEQQEALQELIiNFA 


6892 


3 


876 


RSVAAASGPGAWGTDHYCLELLRKRDYEGYLCSLLLPAESRSSV 

QPVAISLWKAVKRHNLTKRWLMKIVDEREKNLDDKAYRNIKELE 
NYAENTQSSLIiYLTLEILGIKDIJlADHAASHIGKAQGlVTCLRA 
1 in\^iaiAt(.R.v u JjPMUXCMIjHGVSQEDFLRRNQDKNVKDVIYDIA 
SQAHLHLKHARSFHKTVPVKAFPAFLQTVSLEDFLKKIQRVDFD 
IFHPSIiQQKNTLIiPLYLYIQSWRKTY 


6893 


1 


842 


DGBRKSMSVERTFSEINKT^EQYSLCQELCSEIAQDLQKKRLKG 
RTVTI KLKNVNFEVKTRASTVSS W5 TAEE I FAI AKELLKTEID 
ADFPHPLRLRLMGVRIS S FPNEEDRXHQQRS I IGPLQAGNQALS 
ATECTLEKTDKDKFVKPJLEMSHKKSFFDKKRSERKWSHQDTFKC 
BAWKQSFQTSQPFQVLKKKMNENLEISENSDDCQiriTCPVCFR 
ACXSClSIiEALNKHVDECLDGPSISENFKMFSCSHVSATKVNKKE 
NVPAJS SIiCEKQD YEAH 


6894 


1742 


1463 


TTLCKPLVPREHQFYBTLPAEMRKFTPQYKGKSQLLEGLPHWRG 
DVRDRGHGRPWQPSIiEPSLPPTIiCFPSLSSFSSSWPSAQHLTPS 
VFNPW 


6895 


2379 


478 


VTY VE LCDLAS PTALLIMRTVLDLI VEDLQSTSEDKEQQYTSQT 
rRLLALIiYAIASHKACKLAILHLINGTIKGDERYAEIPQDLIAL 
VRS PGDS VI ROOC VE YVTS I LOS IjCDOTi T A I . T T .p «i Q Q pr- <5 T e B-T 

EQIiSNSIiPNKEIiMTSICDCLIiATLANSESSYNCLLTCVRTMMFli 
AEHDYGLFHLKSSLRKNSSAI.HSLLKRWSTFSKDTGELASSFL 
EFMRQIIiNSDTIGCCGDDNGIjMEVEGAHTSRTMSIWAAELKQLL 
QSKEESPENLFLELEKLVLEHSKDDDNLDSLLDSWGLKQMLES 
SGDPLPLSDQDVEPVLSAPESLQNLFWNRTAYVIiADVMDDQLKS 
MMPTPFQAEEIDTDLDLVKVDLIELSBKCCSDFDLHSELERSPIj 
SEPSSPGRTKTTKGPKLGKHKHETFITSSGKSEYIEPAKRAHW 
PPPRGRGRGGPGQGIJRPHDIFRQRKQKTSRPPSMHVDDFVAAES 
KEWPQDGIPPPKRPLKVSQKISSRGGFSGNRGGRGAPHSQNRP 
FTPPASKGNYSRREGTRGSSWSAQNTPRGNYNESRGGQSNFNRG 
PLPPLRPLSSTGYRPSPRDRASRGRGGLGPSWASAWSGSGGSRG 
KFVSGGSGRGRHVRS FTR 


6896 


1 


555 


GNIVlQKKKYNKQHIIPLENVTIDSIKDEGDLRNGWIilKTPTKS 
FAVYAATATEKSEIVMNHXNKCVTDLI/SKSOKTPSNEHAAVWVPD 
SEATVCMRCQKAKFTPVNRRHHCRKCGFWCGPCSEKRFLLPSQ 

sskpvricdfcydllsagdmatcqparsdsysqslksplndmsd 
ddddddssd 


6897 


3 




920 


GDGLMHEVVNGLMERPDWETAIQKPLCSLPAGSGNALAASIiNHY 
AGYEQVTNEDLLTWCTLLLCRRLLSPMNLLSLHTASGIiRLFSVIj 

slawgpiadvdlesekyrrlgemrftlgtflrlaalrtyrgrla 
ylpvgrvgsktpaspvwcx2gpvdahlvpleepvpshwtvvpde 

DFVLVLATiLHSHLGSEMFAAPMGRCAAGVMHLPYVRAGVSRAML 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 

amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
cor re spond i ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amano acad segraenc containing signal peptiae 
{A=Alanine, C^^Cysteine, D^spartic Acid, 
Glutamic Acid, P= Phenyl alanine, G=Glycine, 
H«Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N«Asparagine, 
P=Proline. Q=Glutamine, R=Arginine, 
SsSerine, T=Threonine, v=Valine, 
W:=Tryptophan, Y=Tyrosin€, X=Unknown, *:=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LRLFIAMEKGRHMEYECPYLVYVPWAFRLEPKDGKGVF^VDGE 
LMVSEAVQGQVHPNYFWMVSGCVEPPPSWKPQQMPPPEEPL 


6898 


919 


346 


QKTVTAVA5H*KGRQGIYTENERRMC^AVIKIRPFKIMLVLIICW 

rjSWIINESLLFYLEMQTDINGGSLKPVRTAAKTTWFIMGXLNPA 

QGFLLSIiAFYGWTGCSLGFQSPRKEtQWESLTTSAAEGAHPSPL 

MPHENPASGKVSQVGGQTSDEALSMLSEGSDASTIEIHTASBSC 
NKNEGDPALPTHGDL 


6899 


120 


827 


MKVRKNNDAYLLDKNKINMDCFISCPFKKI^TTLMFSHSGILSL 

LEHGEEYTFSIiPCAYARSILTVPWVELGGKVSVNCAKTGYSASI 

TFHTKPFYGGKLHRVTAEVKHNITNTWCRVQGEWNSVLEFTYS 

NGETKYVDLTKLAVTKKRVRPLEKQDPFESRRLWKWVTDSLRES 

EIDKATEHKHTLEERQRTEERHRTETGTPWKTKYPIKBGDGWVY 
HKPLWKIIPTTQPAE 


6900 
6901 


3 


4SX 


TEVI^SKGIHELRSSTSALHHALEESASLLTMFWRAALPSTHIP 
VLPGKVGESTEREi:iLEr,RTKVSCJQEQLLQSTTEHLKNANQQKES 
MEQFIVSQLTRTHDVLKKARTNLEVRKLIiHQSEAPSLSPTHHHP 
LRDLVGDSWPALRFQEK 


6902 


1 


201 


DDNMVQRLBTDFKMrLQQQSTLEQWAAWLDNVMMQALKPYEGRP 
SFPKAARQFLLKWSPYRYHLGFS 




2 


267 


GAPPPPPSQPFK«PPQAAPSSHPHSDLTFNPSSALEGC2AGAQGA 
SDMPEPSLDLLFELTNPDEliLSYLDPPPLPSNSNDDLLSLFENN 


6903 
6904 


1 


149 


RINQVyRQGPTGIHILVIDQMVQNFQDESCFLFSTVKAES'^BGl" 
HULK 


6905 


464 


2092 


Mh^ASliPVSLSCVLACGDVEGKFDILPigRVQAIQKKSGNPDLIiLC 

VGNFFGSTQDAEWEEYKTGIKKAPIQTYVLGANNQETVKYFQDA 

DGCELAENITYLGRKGIPTGSSGLQIVYLSGTESLNEPVPGYSP 

SPKDVSSLRMMLCTTSQPKGVDILLTSPWPKCVGWFGNSSGEVD 

rKKCGSALVSSIiATGUCPRYHFAAIiEKTYYERLPYRNHIILQEM 

AQHATRFIALANVGNPEKKKYLYAFSIVPMKLMDAAELVKQPPD 

VTENPYRKSGQEASIGKQILAPVEESACQFPFDLNEKQGRKRSS 

rGRDSKSSPHPKOPRKPPQPPGPCWPCLASPEVEKHLWNIGTH 

CYIiALAKGGLSDDHVLILPIGHYQSWELSAEWEEVBKYKATL 

RRFFKSRGKWCWFERNYKSHHLQLQVIPVPISCSTTDDIKDAF ' 

ITQAQEQQIELLBIPEHSDIKQIAQPQAAYFYVELDTGEKLFHR 

IKKNFPLQFGREVLASEAILNVPDKSDWRQCQISKEDEETLARR 

FRECDFEPYDFTLDD 




1 


226 


VSKTGEAETITSHYLFALGVYRTLYLPNWIWRYHFEGFFDLIAI 
VAGLVQTVLYCDFFYLYrTKVLKGKKLSLPA 


6906 


3 


611 


SYDDHNGHIDFITAASNLRAKMYSIEPADRFKTKRIAGKilPAI 
ATTTATVSGLVALEMIKVTGGYPFBAYKNWFLNXiAIPIWFTET 
TEVRKTKIRNGISFTIWDRWTVHGKEDFTLLDFINAVKEKYGIE 
PTMWQGVKMLYVPVMPGHAKRIiKriTMHKLVKPTTBKKYVDLTV 
SFAPDIDGDEDLPGPPVRYYFSHDTD 


6907 


2 


2226 

: 

1 
I 


LRGVPVWAAGAFRFS S GEESTSHLIMSRRSQR t,TRYSQGDDDGS 
SSSGGSSVAGSQSTLPKDSPLRTLKRKSSNMKRLSPAPQLGPSS 
DAHTSYYSESLVHESWFPPRSSLEELHGDANWGBDLRVRRRRGT 
GGSESSRASGLVGRKATEDFLGSSSG YSS EDDYVGYSDVDQQS S 
SSRLRSAVSRAGSLLWMVATSPGRLFRLLYWWAGTTWYRtiTTAA 
SIiIiDVFVLTRRFSSLKTFLWFLLPIiLLLTCLTYGAWYFYPYGLQ 
TFHPALVSWWAAKDSRRADEGWEARDSSPHFQAEQRVMSRVHSL 
ERRLBALAAEFSSNWQKEAMRLERLELRQGAPGQGGGGGLSHED 
rLALLEGLVSRREAALKEDFRRETAARIQEELSALRAEHQQDSE 
DLPKKIVRASQESEARIQQLKSEWQSMTQESFQESSVKELRRLE 
DQIiAGLQQELAAlALKQSSVAEEVGLLPQQIQAVRDDVESQFPA 
tflSQFLARGGGGRVGLLQREEMQAQLRELESKlLTHVAEMCJGKS 
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SEQ 
ID 
NO : 


Predicted 
beginning 
nucleot ide 
location 
corresponding 
to first 
amino acid 
residue of 

amino ar*id 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, 
Glutamic Acid, F^Phenylalanine, G-Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
LsLeucinev M^Methionine, NsAsparagine , 
P=Proline, Q=Glut amine, R=Arginine, 
StsSerine, TaThreonine, V=Valine, 
W^Tryptophan, Y«.Tyrosine, X=»Unknown, *-Stop 
Codon, /spossible nucleotide deletion, 
\=possible nucleotide insertion) 








AREAAASLSLTtiQKEGVIGVTEEQVHHIVKQALQRYSEDRIGUi. 
DYAriESGGASVISTRCSBTYETKTAI.LSLFGlPriWYHSQSPRVI 
LQPDVHPGNCWAFQG PQG FAWRL SARIRPTAVTLEHVPKALS P 
NSTI S SAP KDFAI FGFDEDLQQEGTLLGKFT YDQDGEP I QTFHF 
QAPTMATYQWELRILTNWGHPEYTCIYRPRVHGEPAH 


€908 


3 


780 


QVPSAAWLMAVCGLGSRliGLGSRLGLQGCFGAARLLYPRFQSRG 
PQGVEDGDRPQPSSKTPRIPKIYTKTGDKGFSSTFTGERRPKDD 
QVFEAVG'rTDELSSAIGFALELVTEKGHTFAEEIiQKlQCTLQDV 
GSALATPCSSAREAHLKYTTFKAGPILELEQWIDKYTSQLPPLT 
AFILPSGGKISSALHFCRAVCRRAERRWPLVQMGETDANVAKF 
LNRLSDYLFTLARYAAMKEGNQEKIYKKNDPSAESEGL 


S909 


3 


409 


GRLUVVGTDIiYGQRSSAPEQELLVQnATPVSNSLLPEKAFSDIP 
SPYLRGTIKMMQAVRQAFQDQDDRRTWDGRPLTI^IAATFDDCLYA 
LCWnXIKRSSQTGEWQNIAIMTEEPELSPAYLISEAMRRSRMS 
LYC 


S910 


1 


1068 


LVPVWIDSYYYGKLVIAPLNIVLYNIFTPHGPDLYGTEPWYFY 
LINGPLNFNVAPAIAI*LVr*PLTSLMEYLLQRFHVQNLGHPYWLT 
LAPMYIWFIIFPIQPHKEERFLFPVypLICLCQAVALSAIiQHSF 
LYFQKCYHFVFQRYRLEHYTVTSNWLArXSTVFLFGLIjSFSRSVA 
LFRGYHGPLDLYPEFYRIATDPTIHTVPEGRPVNVCVGKEWYRF 
PSSFIiLPDNWQLQFIPSEFRGQLPKPFAEGPLATRIVPTDMNDQ 
NLEEPSRYIDISKCHYLVDLDTMRETPREPKYSSNKEEWISLAY 
RPPLDASRSS KLLRAP YVPFLSDQYTVYVNYT ILKPRKAKQIRK 
KSGG 


6911 


1184 


966 


GEDAEEMETGN VANLI S I FGSSPSQIiLRKSPGGGREEEEGEESG 
PEAAEPGQICCDKPVLRDMNPWSTAIVAF 


6912 


1 


844 


AMKPVETHSFQMIiFTILSTGSALKAQSYEDAYRCIKSSIL.IiGSI 
SGGTDI IS CFMGHNFS LPVYKGEI QARNLGMAVEAWNEEGKAVW 
GESGEIjVCTKP IPCQPTHFWNDENGNKYRKAYFS KFPGIWAHGD 
YCR INPKTGG I VMLGRSDGTLNPNGVRFGS SE I YWI VE S FE E VE 
DSLCVPQYWKYREERVILFLKMASGHAFQPDLVKRIRDAIRMGL 
SARHVPSLILBTKGIPYTLNGKKVEVAVKQIIAGKAVEQGGAFS 

vlirthL LiULi X t\U X cr JSii^Uur c 


6913 


1643 


. 1558 


KKSHEESHKEELSYGAQASLPLPCSDFR 


6914 


1251 


615 


EIiAAECKSAGYPGTLIPYRCDLSNEEDILSMPSAIRSQHSGVDI 

U X JN CUUaijAKFDTIjLi SGSTS GW KDMFNVNW 

ERNVDDGHI INI NSMSGHRVLPLS VTHFYSATKYAVTALTEGLR 

QELREAQTHIRATCISPGWETQFAPKIjHDKDPEKAAATYEQMK 

CLKPEDVAEAVI YVLSTPAH IQIGDIQMRPTEQVT 


6915 


254 


652 


GRSLS FKTFLI WVLIS IYQGGILMYGALVLFESEFVHWAI SFT 
ALILTELI^VALTVRTWHWLMVVAEFLSLGCYVSSriAPLNEYFD 
VAFI TTVTFLWKVSAI TWSCLPLYVLKYIiRRKLS P PS YCKLAS 


€916 


254 


652 


GEiSLS PKTFI* I WVIjIS I YQGGILMYGALVLFESEF VHWAI S FT 
ALILTELLMVALTVRTWHWLMWAEFLSLGCYVSSLAFLNEYFD 
VAFITTVTFLWKVSAITWSCLPLYVIiKYriRRKLS P PS YCKIiAS 


6917 


254 


652 


GRSIiSFKTFLIWVXjISIYQGGILMYGALVLFESEFVHWAISFT 
ALILTELLMWlLTTOTWHWLMWAEFIiSLGCYVSS LAFLNE YFD 
VAFITTVTFLWKVSAITWS CLPLYVLKYLRRKLS P PS YCKLAS 


6918 


28 


921 


PEAGTRSWRE PDPEDLRRFLLSAACRSPPQWLPGGGGGQVSS CS 
DTDVPYLLLAVKSEPGRFAERQAVRETWGSPAPGIRriljFLIiGSP 
VGEAGPDLDSLVAMESRRYSDLLLWDPLDVPPNQTLKDLLLLAW 
LGRHCPTVSFVLRAQDnAFVHTPAIilAHIiRALPPASARSLYLGE 
VFTQAMPLRKPGGPFYVPESFPEGGYPAYASGGGYVIAGRIAPW 
LLRAAARVAPFPFBDVYTGLCIRALGLVPQAHPGFLTAWPADRT 
ADHCAPRMLLLVRPIiGPQAS IRJLWKQLQDPRLQC 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, K= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H«Histidine, I=Isoaeucine, K^Lysine, 
I»=Leucine, M-Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginxne, 
SsSerine, T=:Threonine, V= Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *«Stop 
Codon, /::=posaible nucleotide deletion, 
\-possible nucleotide insertion) 


6919 


650 


41 


QGRRELSGSVFCPFIQQEPKEMIiTLSEYHERVRSQGQQLQQLQA 

EtiDKIaHKEVSTVRAAWSERVAKI*VFQRLNEDFVRKPDYALSSVG 

ASIDLQKTSHDYADRNTAYFWNRFSFWNYARPPTVILEPHVFPG 

NCWAPEGDQGQWIQLPGRVQLSDITLQHPPPSVEHTGaANSAP 

RDPAVFFLLSFFTHQGLQVYDETEVSLGKFTPDVEKSBIQTFHL 

QNDPPAAFPKVKIQILSNWGHPRPTCLYRVRAHGVRTSEGAEGS 
AQGPH 


6920 


1418 


591 


EAQGPSKVHLTLKKKK 


6921 


2 


1711 


i^uvAiKistiiiyt H V XWilAJiyTIiRKMENYLKEKQLCDVIjLIAGHLRI 
PAHRLVLSAVSDYFAAMFTNDYLEAKQEEVRMEGVDPNAIiNSLV 
QYAYTGVLQLKEDTIESLLAAACLLQLTQVIDVCSNFLIKQLHP 
SWCLGIRSPGDAQGCTELLNVAHKYTMEHFIBVXKNQEFXiLLPA 
NE I SKLLCSDD 1 NVPDEETI PHALMQWVGHDVQNRQGELGMIiLS 
YIRLPtiLP PQLLADIiETS SMFTGDLECQKLLMEAMKYHLLPERR 
SMMQSPRTKPRKSTVGAIjYAVGGMDAMKGTTTIEICYDLRTWSWL 
HIGTMNGRRLQPGVAVIDNKLYWGGRDGLKTLNTVECFNPVGK 
IWTVMPPMSTHRHGLGVATIiEGPMYAVGGHDGWSYLNTVERWDP 

egrqwnyvasmstprstvgwalnnklyaiggrdgssclksmey 
fdphtnkwslgapmskrrggvqvatyngplywgqhdapasnhc 

SRLSDCVERYDPKGDSWSTVAPLSVPRDAVAVCPIiGDKLYWGG 
YDGHTYLiriVBSYDAQRWEWKEEVPVNIGRAGACVVVVKIjP 


6922 


1075 


369 


GLPPAMQKVMYKGLAPEDKTLREIKVTSGAKIMGGGSTINDVLA 

vntpicdaaqqdakaeenkkeplcrqkqhrkvldkgkpedvmpsv 

KGAQERLPTVPLSGMYNKSGGKVRIiTFKLEQDQLWIGTKERTBK 
LPMGSIKNWSEPIEGKEDYHMMAFQLGPTEASYYWVYWVPTQY 
VDAI KDTVLGKWQYF 


6923 


2469 


1660 


lglfcilpidtiicaviterdtlsiresrlfgawrwaeaecqrqq 
lpvtfgnkqkvlgkalslirfplmtibefaagpaqsgilsdrev 

VNLFIiHFT VNPfCPRUE Y T D I? PPfTT.!? CVPr-r* two i?rtr\t roan urr^v 

sgtsdrirptvnrrisivgfglygsihgptdyqvniqiieyekk 
qtlgqndtgfscdgtantfrvmfkepibilpnvcytacatlkgp 
dshygtkglkkwhet paas KTVFFFFSSPGNKNGTSIEDGQ IP 
EIIPYT 


6924 


2210 


1235 


PEERVICFVEYYLTAFHEGRKGAIiAKKPYNPI igetfhcswevp " 
KDRVKPKRTASRSPASCHEHPMADDPSKSYKLRFVAEQVSHHPP 
ISCFYCECEEKRLCVNTHVWTKSKFMGMSVGVSMIGEGVIiRIjI.E 
HGEEYVPTLPS AYARS ILTI PWVELGGKVS INCAKTGYSATVI P 
HTKPFYGGKVHRVTAEVKHHPTNTIVCKAHGEWKGTLEFTYNNG 
ETKVIDTTTLPVYPKKIRPLEKQGPMESRNLWREVTRYLRLGDI 
DAATEQKRHLEEFGQRVEERKRENLRTPWKPKYFIQEGDGSGILQ 
S PLESTLMGLEVQSPPV 


6925 


2 


1553 


SCLQSMPSVRCLQISNGTSSVIVSRKRPSEGNYQKEBCDLCIKYF 
DQWSESDQVEFVBHLISRMGHYQHGHINSYIiKPMLQRDPITALP 
EQGLDHlAENILSYLDARSL<:AAELVCKSWQRVISEGMt.WKKLI 
ERMVRTDPLWKGLSERRGWDQYLFKNRPTDGPPNSFYRSLYPECI 
IQDIETIESNWRCGRHNLQRIQCRSENSKGVYCLQYDDEKIISG 
LRDNSIKIWDKTSLECLKVLTGHTGSVLCLQYDERVIVTGSSDS 
TVRVWDVl!^t3BVLNTLIHHNEAVLHLRFSNGL^4VTCSKDRS lAV 
WDMASATDITLRRVLVGHRAAVNWDFDDKYIVSASGDRTIKVW 
STSTCEFVRTLNGHKRGIACLQYRDRIiWSGSSDNTlRLWDIEC 
GACLRVLEGHEELVRCIRFDNKRIVSGAYDGKIKVWDIjQAALDP 
RAPASTl^CLRTIiVBHSQRVFRLQFDEPQIISSSHDDTILIWDFL 
NVPPSAQNETRS psrtytyi SR 


6926 


X 


733 


SGRVAMDGLGLQFPEQGFPAGPPLTiPPHMGGHYRDCQSLGAPPL 
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SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
secpience 


Amino acid segment containing signal peptide 

(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 

Glutamic Acid F=:Phi=Tivrl =il =n4T>c» n —r-T 4 

X/ ubC4iiui.%.* n^^vAf * ~ riieiiyxaxsnnie 5 w— ijiycme* 

H=Histidine, I-Isoleucine, K=Lysine, 

L=Leucine, M=Methionine, NsAsparagine , 

P= Proline, Q=Glutamine, RoArginine, 

S=:Serine, T^Threonine, V=Valine, 

W=Tryptophan, y=Tyrosine, X-Unknown, *^Stop 

Codon, /^possible nucleotide deletion, 

\=poa3ible nucleotide insertion) 








dgypijptpdtspiiDgvdpdpaffaapmpgdcpaagtysyaqvsd 

YAGPPEPPAGPMHPRLGPEPAGPSIPGLIiAPPSALHVYYGAMGS 
PGAGGGRGFQMQPQHQHQHQHQHHPPGPGQPTPPPEALPCRDGT 
DPSQPAELLGEVDRTEFEQYIiHFVCKPEMGLPYQGHDSGVNLPD 
SHGAI SS WS DASSAVY YCNY PDV 


6927 


2 


1484 


LTLCGDIQLMIiAQNASTOJRAAHLEEFHYQTKEDQEILHSIiHRESS 
CQGFAWATDLSTDLESQIiSVSCKCYEAANBILQFRDLKSQNPEH 
YVQVLKRMGNIRNEIGVFYMNOAAAIiO«;R"RT,V<:Trc;vc!D 1 T?riAT ot 

KKSFSCFEKGIHNFESIEDATNAALLLCNTGRLMRICAQAHCGA 
GDELKRBFSPEEGLYYNKAIDYYLKALRSLGTRDIHPAVWDSVN 
WELSTTYFTMATLQQDYAPLSRKAQEQIEKBVSEAMMKSLKYCD 
VDSVSARQPLCQYRAATIHHRLASMYHSCLRNOVGDEHLRKQHR 
VLADLHYSKAAKLPQLLKDAPCBLLRVQLERVAPASFQMTSQNS 
NVGKLKTLSGAIiDIMVRTEHAFQLIQKEIiIEEFGQPKSGDAAAA 
ADASPSLNREEVMKLLSIFSSRIjSFLLIiQSIKLLSSTKKKTSNN 
lEDDTlLKTNKHIYSQLLRATANKTATLLERINVIVHLLGQLAA 


5928 


1086 


777 


EAIDLINNLI.QVKMRKRySVDKTLSHPWLQDYQrWLDr.RELECK ' 

IGERYITHESDDLRWEKYAGEQGLQyPTtlLINPSASHSDTPETE 

ETEMKALGERVSIL 


'"'^$29 


1749 


607 


RDQRG YRDDRS PARE PGD VS ARTRSGGGGGRS ATTAM P PP VPNG 
NLHQHDPQDIjRHNGNVWAGRPSCSRGPRRAI QKPQPAGGRRSG 

rgpaagglclqppdggtcvpeeppvppmdwealekhlaglqpre 
qevrnqgqartnstsaqkneresirqklalgsffddgpgiytsc 
sksgkpslssrlqsgmnlq i cfvndsgsdkdsdaddsktets lid 

A ±'-i-»ft>i'MbRVfa&i> I oiJrtUTTEBESESLDDMDFIjTRQKKLQAEAKM 
ALAMAKPMAKMQVEVEKQNRKKSPVADLIiPHMPHISECLMKRSL 
KPTBLRDMTIQQLQVIVNDLHSQIESIiWEELVQIiLLIRDELHTE 
QDAMLVDIEDLTRHAESQQKHMAEKMPAK 


6930 


131 


545 


fkdtanvfvslfqmrnnfrhyfibpsqIiKlfydvitwivtqvai ■ 
sytwpfvllsikpsltfysswyyclhilgii^vllllpvkktqr 

RKNTHENI QLSQSKKFDEGEWSLGQNS FSTTNNVCNQNQE IASR 
HSSLKQ 


6931 


2 


659 


FVERLPNRPACLLVASGAAEGVSAQSFIiHCFTMASTAFNLQVAT 
PGGKAMEFVDVTE SNARWVODFRLKAYA<iPa.Tf T.F <5 T nnao vwti t 

LI PSCPGALTDLASSGSIAR I LQHFHSES KPICAVGHGVAALCC 
ATNEDRSWVFDSYSLTGPSVCEtiVRAPGFARLPLWEDFVKDSG 
ACFSASEPDAVHWLDRHLVTGQNASSTVPAVQNLLFLCGSRK 


6932 


2 


1131 


FVDSPGQGEQAEEEEGGIQMNSRMRAHS PAEGASVES SS PGPKK ' 
SDMCEGCRSIiAAGHPG YISHDKBTS I KYVSHQHPSH PQUFSIVR 
QACVRSLS CEVCPGREGPI FFGDEQHGFVFSHTFFI KDSLARGF 
QRWYSI I TIMMDRI YLINS WPFLXjGKVRG I IDELQGKALKVFEA 

eqfgcpqraqrmntaftpflhqrngnaarsltsltsddnlwacl 
htsfawllkaogsrltekllegaptedtlvqmekladleeeses 
wdnseaeeeekapvlpestegreltqgpaessslsgcgswqprk 
lpvfkslrhmrqvggrgtahher.rrranhglclptrlasgpstl 
ktlqbvtds llggwlmaqgvggi i 


6933 


1431 


890 


SLNLHCTLPPPPHQYPAGYPSDKEGKKPKGQSKKQPSGTTKRPI 
SDDDCPSASKVYKASDSASAIEAFQLTPQQQHLIREDCQNQKLW 
DEVLSHLVEGPNFLKKtiEQSFMCVCCQELVYQPVTTECPHNVCK 

DCLQRSFKAQVFSCPACRHDLGQNYIMIPNEILQTLLDLFPPGY 
SKGR 


6934 


3030 


^588 

— { 


DRDHSQCGGIRRVAIiARVSSVKLISKAKIRTVKMTPlIVIiAFIV 
CWTPFFFVQMWSVWDANAPKBASAFI I VMLIiASLNSCCNPWIYM 
LFTQHLFHELVQRFLCCSASYLKGRRLGETSASKKSNSSSFVLS 
HRSSSQRSCSQPSTA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A«Alanine, Cs^Cysteine, D=*Aspartic Acid, E= 
Glutamic Acid, Fs=Phenyl alanine, G=Glycine, 
H=Histidine, I^Isoleucine, K=Lysine, 
LaLeucine, M=Methionine, N-Asparagine, 
P=Proline, Q=Glutainlne, R=:Arginine, 
S= Serine, T=Threonine, V= Valine, 
WsTryptophan, Y«Tyrosine, X^^Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


693 5 


886 


543 


NSALYVAQGNDGTSCLNSVBRYSPECAGAWESVAPMNIRRSTHDli 
VAMDGWLYAVGGNDGSSSLNSIEKYNPRTNKWVAASCMFTRRSS 
VGVAVLELLNFPPPSSPTLSVSSTSL 


6936 


1347 


567 


RSHRRQFLSRALLEFFGKSHPPPHRLPRKSLNVGLHYSHIPFLT 
TCLHPLRKRLQKGEVGLSVETSKPQVPVGGLSRKKVPQEPWATV 
MEKRLQEAQLYKEEGNQRYRBGKYRDAVSRYHRALLQLRGLDPS 
LPSPLPNLGPQGPALTPEQENILHTTQTDCYNNLAACLLQMEPV 
NYERVREYSQKVtiERQPDNAKALYRAGVAFFHLQDYDQARHYLL 
AAVNRQPKDANVRRYLQLTQSELSSYHRKEKQLYLGMFG 


6937 


1 


727 


AVEFRCCPGRDPACFARGWRLDRVYGTCFCDQACRFTGDCCFDY 
DRACPARPCPVGEWS PWSGCADQCKPTTRVRRRSVQQE PQNGGA 
PCPPLEERAGCLEYSTPQGQDCGHTYVPAFITTSAFNKERTRQA 
TSPHWSTHT3DAGYCMEFKTESLTPHCALENRPLTRWMQYLREG 
YTVCVDCQPPAMNSVSLRCSGDGLDSDGWQTLHWQAIGNPRCQG 
TWKKVRRVDQCSCPAVHS FIFI 


6938 


3 


719 


NSRKLEIjAERVDTDFMQLKKRRQSSEKENDSGTLDTVGAWVDH^ 
EGNVAAAVSSGGIALKHPGRVGQAALYGCGCWAENTGAHNPYST 
AVSTSGCGEHLVRTILARECSHALQAEnAHQALLETMQNKFISS 
P FLASEDGVLGGVI VLRSCRCSAEP DSSQNKQTLLVEFLWSHTT 

esmcvgymsaqdgkakthisrlppgavagqsvaieggvcrlgep 
sbltlqaeceasqrhfrt 


6939 


3 


810 


kvtaprrpqryssghgsdnssvlsgelppamgrtalfhhsggss 
gyeslrrdseatgsassapdsmsesgaaspgartrslkspkkra 
tglqrrrlipaplpdttajlgrkpslpgqwvdlppplagslkepf 
eikvyeiddverlqrprptpreaptqglacvstrlrlaerrqqr 

LREVQAKHKHLCEEIiAETQGRLMLEPGRWLEQFEVDPELEPESA 
EYLAALERATAALEQCVNLCKAHVMMVTCFDISVZ^ASAAIPGPQ 
EVDV 


6940 


1188 


496 


GKMAAQPLRHRSRCATPPRGDFCGGTERAIDQASFTTSMEWDTQ 
WKGSSPLGPAGLGAEEPAAGPQLPSWLQPBRCAVFQCAQCHAV 

ia]>svhlawdlsrslgavvfsrvtnnwleapft.vgibgslkgs 
tynllfcgscgipvgfhlysthaalaalrghpclssdkmvcyll 
ktkaivkasemdiqnvplsekiaelxekivlthnrlkslmkils 
evtpdqskpen 


6941 


1 


713 


SLSRADSDPHGPHTCGHVLm?IIGSNVIiAIiAEAQRQAEAI.GYQA 
VVLSAAKQGDVKS^^QFYGLtAHVARTRLTPS^IAGASVEEnAQX» 
HEliAAELQ I PDLQLEEALETMAWGRGPVCLIAGGBPXVQLQGSG 
RGGRNQEXALRVGAELRRWPLGPI DVLFLSGGTDGQDGPTEAAG 
AWVTPELASQAAAEGLDIATFLAHNDSHTPFCCLQGGAHLLHTG 
MTGTNVMDTHLLFLRPR 


6942 


1 


246 


GDYVERYDPKTDTWTMGAPLSMPTNAVGGCLLGDRLYADGGYDG 
QTYLNTMESYDPQTNBWTQMASLNiaRAQACVWIKQP 


ft OA'S 


1 


739 


PMATGDGAKTIiAI HVKALTADS I R IT WKATLPASS FRLSWLRLG 
HS PAGGS ITETLVQGDKTEYLLTALE PKPT YI I CMVTMETTWAY 
VADETPVCAKAETADSYGPTTTLNQEQNAGPMASLPLAGIIGGA 
VALVFLFLVLGAl CWYVHQAGELLTRERAYNRGSRKKDDYMESG 
TKKDKS ILB IRGPGLQMLPINPYRAKEEYWHTI PPSKGSSLCK 
ATHTIG YGTTRGYRDGGI PDIDYS YT 


6944 


960 ' 


156 


VANILLNGVKYESBLTGSSERAEQPLSVGRI.CSTICNMPKALRT 
LCVNHFK3WLSFEGMLLFYTDFMGBWFQGDPKAPHTSEAY0KY 
NSGVTMGCWGMCI YAFSAAFYS A I LEKLEEFLS VRTLYFIAYIiA 
FGLGTGLATLSRNLYWLSLCITYGIIiFSTIiCTLPYSLLCDYYQ 
SKKFAGSSADGTRRGMGVDISUiSCQYFtiAQILVSLVLGPLTSA 
VGSANGVMYFSSLVSFLGCLYSSLFVIYEIPPSDAADEEHRPLL 
LS3fV 
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SEQ 
ID 
NO: 


Predicted 
beginniiig 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acad segment containing signal peptide 
{A=Alanine, C=Cysteine, Dispart ic Acid, E= 
Glutamic Acid, P=^ Phenyl alanine, G=.Glycine, 
H=Histidine, I=iIsoleucine, K=Lysine, 
L-Leucine, M=Methionine, N-Asparagine , 
P= Proline, Q==Glut amine, R=Arginine, 
S=serine, T=Threonine, VasValine, 
W^Tryptophan, Y==Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\«poesible nucleotide insertion) 


6945 


2067 


179 


EGEDRGLPRT^K3AALGTGTRLAPWPGRAC6AXiPRWTPTAPAQGC 
HSKPGPARPVPLKKRGYDVTRNPHLNKGMAFTLEERLQLGIHGb 
XPPCPLSQDVQLLRIMRYYBRQQSDLDKYI ILMTLODRWEKLFY 
RVLTSDVEKPMPIVYTPTVGIiACQHYGLTPRRPRGLPlTlHDKG 
HliATMLNSWPEDNIKAVWTDGERlLGLGDLGCyGMGIPVGKLA 
LYTACGGVNPQQCLPVLLDVGTNNEELLRDPLYIGLKHQRVHGK 
AYDDLLDEFMOAVTDKFGINCDIQFEDFAKANAFRLI^KYRNKY 
CMFNDDIQGTASVAVAGIIAALRITKNKLSNHVFGFQGAGEAAM 
G\lAHLLVMALB\KEGVPKA\EATRKrW\MVDF\KGDIVQGRDH 
riNHEKBMFAQD\HPEWSLEEWRI.VKPTAIIGVAAIAEA\FTE 
QILRDMASFHERP\iIFALSNPTSKASCTA\EKCyRVTEGPRGF 
FAS\6SPP*GVLIWEMGKTPIPGGRGNNA*RVPRGWQLGVHSPG 
GDPGHIP\DEIFLPDSRAKLPQEVSEQHLSQGRLYP\PLST\1R 
NVFLRIAIKVFD*GyKHNLV\SYYPEPKD\KEAFCKIPGSYTPD 
YDS PYT/ VDS y I WAQG KAMKVQTV 


6946 


133 


2551 


SCEYSGITVAPGDPCPGVAHLLAPSMASDTPESLMALCTDFCr/R 
NZiDGTLGYLLDKETIiRLHPDrFLPSElKCDRLVNEYVELVNAAC 
NF\EPHS\SFFNPr,FRDPRKQPASRRIHL\RED\LVQD\QDVLE 
AIRKQDL\VEL\YL1W\CBKLSAKSLQ1TJ?SFSHTLGVP*AFFG 
C\TNILLr,RKENPGGL/CEDEYLFNPTCQVLVKDFTFBGFSRLR 
P\LKLGRMIDWVPVES \LLRPLNSIAALDLSG IQTSDAA\ PLTQ 
'^voAjvjj \ii>ipiuuoiJUiiXK NVXVQDHKIjRHIjDISRDRLSS 

yykfkltrevlslfvqklgnlmsldisgXhmilehcsiskigkr 

EAGQTSI\EPSK\SSIIPPRGFEGGPLQF\liGVF*GIFCGRLTH 
IPAYKVSGDKNEEQVLNAIEAYTEHRPEirSRAINLLFDlARIE 

rcnqllralklvitalkchkydrniqvtgsaalfyltnseyrse 

QSVKLRRQVIQWLKTGMESYQEVTVORKCCLTLCNPSIPEELBF 

qyrrvnelllsilnptrqdesiqriavhlcnalvcqvdndhkba 

VGKMGPVVTMLiCLTQKKLLDKTCDQVMEFSW\SALWNITDBTPD 
NCEMFIiNPNGMKIiFLDCLNEFPBKQEIiHRjmiiGHjGNVAEVKEL 
RPQLMTSQFISVFSNLLESKADGIEVSYNACGVLSHIMFDGPEA 
WGVCEPQREEVEERMWATVIQSVroiNSRRNlNYRSFEPILRLLPQ 
GISPVSQHWATWAIiYNLVSVYPDKYCPLLIKEGGMPLLRDIIKM 
ATARQETKEMARKVrEHCSNFKEENMDTSR 


6947 


2 


16 B2 


TSVST I PRGIASARPQSRSWRCCPVWRRSPGRARGRGIiKMLWP 
SQSFPAPRSQQRVASGGRSKVPLKQGRSIiMDWIRLTKSGKDLTG 
LKGRLIEVTEEBLKKHNKKDDCWICIRGFVYNVSPyMEYHPGGE 
DELMRAAGSDGTBLFDQVHRWVWYESMIjKECLVGRMAIKPAVLK 
DYREEEKKVLKGMLPBCSQVTDTIAKEGPSYPSYDWFQTDSLVTI 
/EHlY*TEGYQFRLNNS*SSB*FLySRNNY*GLLISYTYW/R*A 
MRFRKIFIiCGL/CESVGKIEIVLQKKEHTSWDFLGHPLKNHNSL 
IPRKDTGLyYRKCQLlSKEDVTHDTRLFCLMLPPSTHLQVPIGQ 
HVYLKLPITGTEIVKPYTPVSGSLLSEFKEPVLPNWKYIYFLIK 
lYPTGLFTPELDRLQIGDFVSVSSPEGNFKlSKFQELEDLFLIiA 
Auiot i i^MVK.iiiKx AIjTDIPSLRKvKLMFFNKTEDDIIWRSQLE 
KtiAPKDKRIiDVEFVIiSAPISEWNGKQGHISPALLSSFljKRNXiDK 
SKVLVCICGPVPFTEQGVRLLHDLNFSKNEIHSFTA 


6948 


104 


58 


PDGAHSFFPDEYFTCSSLCLSCGVGCKKSMWHGKEGVPHEAKSR 

CRYSHQYDNRVYTCKACYERGBBVSWPKTSASTDSPWMGIiAKY 

AWSGYVIECPNCGWYRSRQYWPGNQDPVDTWRTEIVHVWPGT 

DGFLKDNNNAAQRLliDGMNFMAQSVSELSLGPTKAVTSWLTDQI 

APAYWRPNSQILSCWKCATSPKDNDTKHHCRACGEGFCDSCSSK 

rRPVPERGWGPAPVRVCDNCYEAR/TRPVSCYRGTSGR*RRRRT 
3ETVB 


6949 


152 


4656 I 


3LRLCLSRPLTRPGDDS VGGSAMASGAGGVGG6GGGKI Rf RRCH 
3GPIKPYQQGRQQHQGILSRVTESVKNIVPGWLQRYFNKNEDVC 
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SEQ 
ID 
NO: 



6950 



Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Amino acid segment containing signal peptide 
{A-Alaaine, CoCysteine, D=A5partic Acid, E= 
Glutamic Acid, P-Phenyl alanine. G=Glycine 
H=Histidine, I^Isoleucine, K=Lysine 
li'Leucine, M=.Methionine, N=AsparagiAe, 
P=Proline, Q=Glutamine, R^Arginine, 
S=:Serine, T=Threonine, V=Valine 
W=Tryptophan, Y^Tyrosine, X»Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
Nt^posaible nucleotide inserti on) 
SCSTDTSEVPKWFBNKEDHL VyADEESSKlTDGRITPEPAVSNT ' 



*^*««viriv«2rii«iuii;iiijvxADEESSWITDGRITPEPAVSNT 

EEPSTTSTASTVYPDVLTRVSLYRSHLNFSMLESPALHCQPSTS 

SAFPIGSSGFSLVKEIKDSTSQHDDDWISTTSGFSSRASDKDIT 

VSKNTSLPPLWSPEAERSHSLSQHTATSSKKPAFtJLSAFGTLSP 

SLGNSSILKTSQUSDSPPyPGKTTYGGAAAAVRQSKLRNTPYQA 

PVRRQMKAKQLSAQSYGVTSSTARRILQSLBKMSSPLA0AKRIP 

S I VSSPLNS PLDRSGID ITDFQAKREKVDSQYPPVQRLMTPKPV 

SIATNRSVYFKPSLTPSGBFRKTNQRIDKKCSTGYEKNMTPGQEJ 

REQRESGFSYPNPSLPAANGLSSGVGGGGGKMRRERHAFVASKP 

LEEEEMEGPVLPKISLPITSSSLPTFNFSSPEITTSSPSPINSS 

QALTflKVQMTSPSSTGSPMFKPSSPIVKSTEANVLPPSSIGFTP 

SyPVAKTAELSGSSSTLEPIISSSAHHVTTVNSTNCKKTPPEDC 

EGPPRPAEILKEGSVLDILKSPGPASPKIDSVAAQPTATSPWY 

TRPArsSFSSSGIGFGBSLKAGSSWQCDTCLLQNKVTDNKCIAC 

GAAKLSPRDTAKQTGIETPNKSGKTTLSASGTGFGDKFKPVIGT 

MS^r^^^"^^^^^^^^'=^^^KPGTC\nCRALTLTA^^ 

MTASSSSCTVTTGTLGFGDKFKRPIGSWECSVCCVSNNAEDNKC 

VSCMSEKPGSSVPTSSSSTVPVSLPSGGSLGLEXPKKPEGIWDC 

ELCLVQNKADSTKCIACESAKPGTKSGFXGFDTSSSSSNSAASS 

SFKFGVSSSSSGPSQTLTSTGNPKPGDQGGFKIGVSSDSGYINP 

MSEGP*FSKHIVGFKFGVSSESKPEEVKKDSKNDNFKK3LSFGL 

SNPVFLTPFQFGVSNIXSQEEKKEELLKSSCAGFRFGTGVINSTR 

VPANTIVTSENKSSFNLGTIETKSVSVAPLKCQTSEAKKEEMPA 

vr^Mv^^'V^^''^^^^^^^^^^2=^QQSPVTSTSLVFGEG 

KLTMKSPKCXQPVFSFGEFQRQTKDENSSKSTFSFSMTKPSEKB 

SEQPAKATFAFGAQTNTTADQGAAKPDLSYLNNSSSSSSTPATS 

AGGG\IFGSSTSSSNPPVATFVFGQSSNPGSSS\AFGNTAESST 

SQSLLPSQDSiCLATTSSTGTAVTPPVFGPGASSNNTTTSGFGPG 

ATTTSSSAGSSFVFGTGPSAPSASPAPGAMQTPTPGQSQGASOP 

NPPGFGSISSSTALFPTGSQPAPPTFGTVSSSSQPPVFGOOPSO 

SAPGSOTTPNSSSAPOPGSSTTNFNFTNNSPSGVFTFGANSSTP 

AASAQPSGSGGPPFNQSPAAFTVGSNGKNVFSSSGTSFSGRKIK 
TAVRRRK 

FXFUSRSGLCRKAGERGAVRAG GLSRRTRAE* IMDELHYQDTDS 
DVPEORDSKCKVVWTHii!i?riT7nT ri7iTTmrtY:./-,«™T.,„«. ^ 



X940 



"239" 



. ..^wc,^i3u,uu«tcKi:,£;KUAVKAGGLSRRTRAE*iMDELHYQDTDS 

DVPEQRDSKCKVKWTHEEDEQLRALVRQFGQQDWKPLASHPPNR 

TDQQCQYRWLRVLNPDLVKGPWTKBEDQKVIELVKKYGTKQWTL 

lAKHLKGRLGKQCRBRWHNirLNPEVKKSCWTEEEDRIlCEAHKV 

LGNRWAEIAKMLPQRTDNAVKNHWNSTIKRKVOTGGFLSESKDG 

KPPVYLLLELEDKDGLQSAQPTEGQGSLLTNWPSVPPTIKEEEN 

SEEELAAATTSKEQBPIGTDLDAVR-rPBPLEEFPKREDQEGSPP 

ETSLPyKWWEAAWLLIPAVGSSLSEALDLlESDPDAWCDLSKF 

DLPEEPSAEDSINNSLVQLOASHQQQVLPPRQPSA\LVPSVTBY 

^i:,^P^^^^^^^^^^^^^^^SPSTEVGGSGlGTPPSVLKRQRK 

RRVALSPVTEirSTSLSFI,DSCMSr.TPKSTPVKTLPFSPSQF£NF 

WNKQDTLELESPSLTSTPVCSQKVWTTPLHRDKTPLHQKHAAF 

VTPDQKYSMDNTPHTPTPFKNALEKYGPLKPLPQTPHLEEDLKE 

!^^Sf?°^^^^^^°^^^^®KQ^KP'5^I^PIKKVRKSLALDIV 

DBDMKLMMSTLPKSLSLPTTAPSNSSSLTLSGrKEDNSLLWQGP 

LQAKPEKAAVAQKPRSHPTTPAPMSSAWKTVACGGTRDQLFMOE 
KARQLi:^RLKPaHTSRTLILS ^Si^trn^H 

A^PDUTMKRSLgAJ.YCQLLSF l,LIlALTBALAKAIOEPSPRESL ~ 

QVLPSGTPPGTMVTAPHSSTRHTSVVI4LTPNPDGPPSQAAAPMA 

TPTPRAEGHPPT\TPSPPSLRQ*PPPILKAP/SSrGPAPAAMAT 

TSSKPEGRPRGQAAPTILLTKPPGATSRPTTAPPRTTTRRPPRP 

PGSSRKGAGNSSRPVPPAPGGHSRSKEGQRGRKPSSTPLGQKRP 

LGKIFQIYKGWFTGSVEPEPSTLTPRTPLWGYSSSPQPQTVAAT 
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SEQ 
ID 
NO: 


Predicted " 

r^ucleotide 
location 
corresponding 
to first 

no acxo 
residue of 
amino acid 


Predicted end 
nucleoticie 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


amino acid segment containing signal peptide " 
CA^Alanxne, C=Cyeteine, D.Aspartic Acid, 
GLutamxc Acid, P=Phenyl alanine, G^GlyciAe, 
H=Histxdxne, I-Isoleucine , K-Lysine 
L=Leucine. M=Methionine, K=.AsparagiAe, 
P= Proline, Q=Glutamine, R^aArainin*^ 
S=serine, T=Threonine, Vr=Valine, 
W=Tryptophan, Y^Tyrosine, X^Unknown, *=stop 
codon, /=posBible nucleotide deletion 
__\=pos0ible nucleotide insertion! 


^952 






TV^i^NTSWAPliibbGPAKDKPGLRRAAUG'GGSTFTSQGGTPDA 
TAASQAPVSP/PSCPSAFSAPPPR*PTGWPQP**LLAYCYP\CT 
SRPLSTSSGVFTAATGPTPAAFDTSVSAPSQGIPQGASTTPQAP 
THPSRVSESTISGAKBETVA\PSP*PTGCPVLSPQWyPQP^^^ 

TPQQAC\RGAAPGBIPVPLKPLRrQLSEPRSPANGDYRDTGMVP 




1512 


304 


^:^nr''''''''"'"^^'^^^^^^^^^^^'^-^^'JAATPTPP5GSG\C5 
PTPRLVLLLHGPLRPSQLU?HCGE*EQSASPLQLDGKDASALWT 
ASRQARGELRLCLTTAVRGTSPSVSPVCOSS ^'^''^^^^^^^T 


6954 


819 


349 


wwcjO-RAIASGKHVPFGKQTOPNKS/VHCDS^G^^RRETTODES 

^»i'f«^^^'^^°^^^^^Q^^^^QJiTAAVVSAKSYM\CPVC 
GRALSSPGSLGRHriI»lHSEDORSWCAVfY^ft»PTciuarT,c«KTooT/T « 

BVLNMBSLPTVaNEGPSSAEGKDiAPSPPVYPAOILLVCMNCAA 
WmLEAQTPSVRKWAI.RIlQNEPr,EVRLQRLERBRTAKKSRRDM 

flLflAEMKIPFQI.PVSGVELDSOI,MKMapppAMOoer.,," ^ 


~6955 


19^8 


1 


*PQR*RMTEGNSPQRSAVATPGQGASPAAPRCTP*PSRHHRRLP 
PGARPPAG*AAPAPTKPWLAGPASAPQPOAAPLSPPAPPLIRTR 

♦CAGAAARGRPRRDRSPRPRTPGGCSWSEPRTPPAVSASAQTPS 

DAG * AGGR*G QRORPS TGR * P on vrr* anDotionw^mTT^^* 
nn^T^-.. *^w«.v;r«;j:'Oii3« «'*^'^vtjGAGRsHRREGTIPGNPHPR 

AS*RAGWaR*PGP/REWGI..EPQGBEMSGPGGPGGAPPNQVGSS 


"6SS6 


aso5 — 


782 


GBiroNEEHBYASKKRKIOJHPRNDTNTQSFYRBKWlYVHKESTK 
BRHGYCTLGBAFNRLDPSSAIQDIRRFHYWKLLQLIAKSQLTS 
LSGraQKNyFWII,DKIVOKVlJ,DHHOTRLIKDU,QDI,SSTLCID 
/N*ESREVCISGKHQYLDLPIRNy8RLATTATGSSDD*ASE\NG 
LTLSDLPLHMLIDIILYRPSDGWDIITLGQVTPTLYMI^EDROLW 

GOTLHFCRHCSILFWKDSGHPCTAADPDSCFTPVSPQHF^LPK 




1 


J839 

i 
1 
I 
C 

f 
I 
V 
A 
S 
K 


WXB-IBIFASPTSPPVl^EgVLQDNSPDLNNGgi5AlQEEM3fQ5s- 

DFPPSLTQPAPDQSSTIQLHPATSPAVSPTTSPAVSLWSPAAS 

Tco^^fr^^^^SP^V^S^SPASSAVLPAVSLBVPLTASV 

rf!^f'''"^^***^P^*S^«'KDVSSPI.ETTADVEEITOEG£T 

RSGSGDVMRRRIATPEEVRLPLQHGWRREVRIKKGSHRWQGETW 

yYGPCGKRMKQFPBVrKyLSRNWHSVRREHFSFSPRMPTODFF 

BERDrPEGLQWVQLSAEEIPSRIQAITGKRGRPRNTEKARTKEV 

PKVKRGRGRPPKVKITEIiLNIfTniTOPT r-ttt i?A/~»c.fT*nirr>i-.T^™ 

^KSKKKMRQKVQRGECQTTIQGQARNKRKQETKSLKQKEAKKKS 
CAEKBKGKTKQEKLKEKVKREKKEKVKMKEICEEVTKAKPA^^ 
CTLATQRRLEERQRQQMILEEMKKPTEDMCLTDHQPLPDPSRVP 
nAJ:^f??^^°^^^^^^^^^°^^°^PAKDVPSLGVLQEGI, 
.CQGDSLGEVQDLLVRLLKAALHDPGFPSYCQSLKILGEKVSEI 
LTRDNVSErLRCFLMAYGVEPALCDRLRTQPFOAQPPQQKAAV 
AFLVHELNGSTLI INEIDKTLESMSSYRKNKWI VEGRLRRLKT 
lAKRTGRSEVEMEGPEECLGRRRSSRIMEVTSGMEEEEEEBSI 
AVPGRRGRRDGEVDATASSIPELERQIEKLSKRQLFFRKKTiLH 
SQMLRAVSLGQDRYRRRywVLPYLAGrFVEGTEGNLVPBEVIK 
ETOSLKVAAHASLNPALFSMKMELAGSNTTASSPARARGRPRK 
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SEQ 
ID 
NO: 



predicted 
beginning 
nucleobide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



3514 



amino acid segment co ntaining aignal peptid e 
(A^Alanxne, C^Cysteine, D=Aspartic AcidT E= 
Glutamic Acid, P=Phenylalanine, G=Glycine 
H^Histidine, I=Isoleucine, K=Lysine 
L-Laucine, M=Methionine, N=Asparagi Ae , 
P= Proline, Q^Glutamine, R^Arginine, 
SaSerine, T=Threonine, V=Valine, 
W==Tryptophan, Y=.Tyrosine, X«Un3cnown, *=Stop 
Codon, /=poasible nucleotide deletion, 
\=pog9ible nucleotide ins ertion) 

LQLQSHKGFLEQEGSPLSLGQSQHDLSQSAPLSWLSQTQSHSSL 
LSSSVLTPDSSPGKLDPAPSQPPEEPBPDEAESSPDPQALWFNI 
SAQMPCNAAPTPPPAVSEDQPTPSPQQLASSKFMNRPSAANPCS 

FKQMEQRYLTQLTAQPVPPEMCSGWWWIRDPEMLDAMLKAIiHPR 
GIREKALHKHI.NKHRDPLQEVCLRPSADPIFEPRQLPAFQEGIM 
SWSPKEKTYETDIAVLQWVESLEQRVIMSDLQIRGWTCPSPDST 
REDLAYCEHLSDSQEDITWRGRGREGLAPQRKTTNPLDLAVMRL 
AALEQNVERRYLREPLWPTHBWLSKALLSTPNGAPEGTTTEIS 
YEITPRIRVWRQTLERCRSAAQVCLCLGQLERSIAWEKSVNKVT 
CLVCRKGDNDEFLLLCDGCDRGCHIYCHRPKMSAVPEGDWFCTV 
CLAQQVEGEPTQKPGFPKRGQKRKSGYSLNFSEGDGRRRRVLLR 
GRESPAAGPRYSEEGLSPSKRRRLSMRKHHSDLTPCEIILMEME 
f^^^fJ^P^^^^^^^^^^^^MDFSTMRERLLRGGYTS 
SEEFAADALLVFDNCQTFNEDDSSVGKAGHIMRRFFEXSRWEEF 
^Q^^Q^QSVRQGRWGVTLWHLPPTFQTKTCHFHLLMLPWVQTQV 



"6958" 



274 



1663 



6959" 



Ki.xvA.^EPTKKEENEVPAPA PJ;>PEEP5KBKEAGTTPAKDWTL V 
ETPPGEEQAKQNANSQLSILFlEKPQGGTViCVGEDITPIAKVKA 
BDLSEKPTINGSRKWMDIiASKAGKHLQLKETFERHSRAnfTFEMO 
IIKAKDNPAGNYRCEVTYKDKFDSCSFDLEVHESTGTTPNIDIR 
SAFKRSGEGQEDAGELDFSGIiLKRRSVKQQEEEPOVDVWELLKN 
TKPSEYEKIAFQYESPTCSGMLKRLKRSIREEKKSAAPAKILDP 
VYQVDKGGRVRFWELADPKLEVKWNKNGQELRPSTKYIFEDTR 
CQS I LWIDNCQMTDDSEYYVTAGDEKCSTELLVREPPIMVTKOL 
EDTTDYCGERVELECEVSEDDAQVKWPKNGEEIILVQTRYRIRV 
EGKKHILIIEGATKADAADYSVMTTGGQSSAKLSVDLKPLKILT 
PLTDQTVNLGKErCLKCEISENIPGKWTKNGLPVQESDRLKWH 
KGRIHKLVIDHALTEDEGDYVFAPDAYNVTLPAKVHVIDPPKII 
LDGLDADNTVTVrAGNKLRLElPISGEPPPKftMWSRGDKAIMEG 
SGRIRTESYPDSSTLVIDIAERDDSGVYHINLKNEAGEAHASIK 
VKWDFPDPPVAPTVTEVGDDWCIMNWEPPAYDGGSPILGYFIE 
RKKKQSSRWMRLNFDLCKETTPEPKKMIEGVAYEVRIPAVNAM 
GISKPSMPSRPPVPIAVTSPPTLLTVDSVTDTTVTMRWRPPDHI 
GAAGLDGYVLEYCPEGSTSAKQSDENGEAAYDLPAEDWIVANKD 
LIDKTKPTITGLPTDAKIFVRVKAVNAAGASEPKYYSQPILVKE 
IIEPPKIHSPKHLKQTYIRRVGDRVILVIPFQGKPRPELTWKKD 
GAEIDKNQINIRNSBTDTIIFIRKAERSHSGKYDLQVKVDKFVE 
TASIDIRIIDRPGPPQIVKIEDWGRNVALTWTPPKDDGNAAIT 
GYTIQKADKKSMEWLRVIEHIIBPVPHTELVIGNEYYPRVFSEN 
MCGLSEDATMTKESAVIARDGKIYKNPVYEDPDPSEAPMFTQPL 
VNRLCHSGYMATLNCSVRGNPKPKITMMKNKVAIVDDPRYRMFS 
NQGVCTLEIRKPSPYDGGTYrr(fAVNnT^wt?Tr.r,irT.«rT«.^,^ 




-.X* vi^A^yioyvj&OAl^rsVKVDIEKEVTCPICIiEliLTEPLSL" 
DCGHSFCQACITAKIKESVIISRGESSCPV0QTRFQP6NLRPNR 
HIANIVERVKEVKMSPQEGQKRDVCEHHGKKLQIFCKEDGKVIC 
WVCELSQEHQGHQTFRINEWKECQEKLQVALQRLIKEMQEAEK 
LEDDIRQERTAWKNYIQIERQKILKGPNEMRVILDNEEQRELQK 
LEEGEVNVIJDNLAAATDQLVQQRQDASTLISDLQRRLRGSS 
LQDVIDVMKRSESWTLKKPKSVSKKLKSVFRVPDLSGMLQVLKE 
LTDVQYYWVDVMLNPGSATSNVAISVDQRQVKTVETCTFKNSNP 
CDFSAFGVFGCQYFSSGKYYWEVDVSGKIAWILGVHSKISSLPTK 
RKSSGFAPPPSVWYSKVYSRYRPQYGYWVIGLQNTCEYNAPEDS 
SS5DPKVIiTLFMAV\LPVVLG FS 

ai.VHW£FGRGiEDFPYi:,FFQ LTHCC30RICSVTQAGVQWCI>HSS ~ 
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ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

c orr e spondi, ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid aegmenc containing signal peptide 
(A=Alanine, C^Cysteine, D=Aspartic Acid, 
Glutamic Acid, F« Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glut amine, R=sArginine, 
S=serine, r=Threonine, VoValine, 
W=Tryptophan, Y^Tyrosine, X=Unlaiown, *=stop 
wv^vii^i^, /Hpt^asADxe nucxeocxde deletion, 
\-possible nucleotide insertion) 


5960 






LQPQTPGLwg5SHI.SI*LSSRDYRMLSSFKEWFWQDRFWLPPNVT 
WTBIiEDRDGRVYPHPQDU^AALPIJU^VLIAMRLAFERFrGLPLS 
RWLGVRDQTRRQVKPNATLEKHFLTBGHRPICEPQLSI/LAAQCGL 
TLQQTQRWFRRRRNQDRPQLTKKFCSASWRPLPYLSSFVGGLSV 
LYHESWIiWAPVMCWDRYPNQLTLSCPAADSEA\SLYWWYLLELG 

fylsllirlpfdvkrkgggpssikprphydppstaNdfkeqvih 

HFVAVlLMTFSYSANLLRIGSLVLLIiHDSSDYLLKACKMVNYMQ 

yqqvcdalflipspvfpytrlvlfptoibyttyyesisnrgpff 
gyypfwgiliimlijqi,lhvfwsclilrmr,ysfmkkgq^3ekdirsdv 
eesdsseeaaaaqeplqlkngtaggprpaptdgprsrvagrltn 

RHTTAT 




387 


2068 


JtAw-HiosjsjiiyiUiti? \TKbif'F\RGRPDLSTLTHSIVRRRYLAHSGRS 

hlepeekqalkrlvbebplkmqvdeaasrb2?kl,dltkkqkrppt 
pcsdperkrfrfnsesesgseasspdypgppakngvasrshthp 
keekprra\skaveessdebrqrdlpaqrgeesseeeekgykgk 
trkkpwkkqapgkasvsrkqareeseeseaepvqrtakkvegn 

KGrKSrjCBSBQESEEEIIiAQKKEQREfiEVEBEEKEErEEKGDWK* 

prtrsngrrksareersckqksqakrllgdsdseeeqkeaassg 

DDSGRDREPPVQRKSEDRTQLKGGKRLSGSSEDEEDSGKGEPTA 

kgsrkmarlgstsgeesdlbrevsdseagggpqgerkijrsskks 

SRKGRrasSSSSSDGSPEAKGGKAGSGRRGEDHPAVMRLKRYIR 
ACGAHRNYKKLLGSCCSHKERLSILRAELEAIiGMKGTPSLGKCR 
ALKEQREEAAEVASLDVAWI ISGSGRPRRRTAWNPLGEAAPPGB 

lyrrtldsdeerprpappdwshmrgiissdgesn 


6961 
6952 


340 


1646 


RPWSSPTMKPlMFSUiLRIPNUJCWGIPYLSKHRADRiSlRRLGDFL 

nqespdialleevwseqdpqylrqklsptypaahhfrsgiigsg 

LCVPSKHPIQELTQHIYTLNGYPYMIHHGDWFSGKAVGLLVLHL 
SGPfVLNAYVTHLHAEYNRQKDIYLAHRVAQAWErAQFIHHTSKK 

adwllcgdlnmhpedlgccllkewtglhdayletrdfkgseeg 

NTMVPKNCYVSQQELKPFPFGVRIDYVLYKAVSGFYISCKSFET 
TTGPDPHRGTPLSDHEAIiMATLFVRHSPPQQNPSSTHGPVaERS 
PLyMCVCLKEAIJDGSL6LGMA\QARWWA\TFA\SYVIGLGL\IiL 
LALLCVXiAAGGGAGEAAILLWTPSVGLVLWAGAFYLFHVQEVNG 
LYRAQ7VELQHVLGRAREAQDLGPEPQLYAZiL\LGQQEGDRTKEO 


6963 


346 


1646 


rpwssptmkpwfslrlrifkdncwgipylskhradrmrrlgdfl 

NQESPDIiALLEBVWSEQDPQYLRQKLSPTYPAAHHFRSGIIGSG 
LCVFSKHPIQELTQHrYTLKTGYPYMlHHGDWFSGKAVGLLVLHL 
SGMVItNAYVTHl^HAEYKRQKDIYLAHRVAQAWELAQFIHHTSKK 
ADWLLOSDLNMHPEDLGCCIiIjKSWTGLHDAYLETRJDFKGSEEG 

ntmvpkncyvsqqelkpfpfgvridyviiykavsgfyiscksfet 
ttgfdphrgtplsdhealmatlfvrhsppqqnpssthgp\aers 

PL/MCVCLKEALDGSJjGLGMA\QARWWA\TFA\SYVIGLGL\LL 
LALLCVLAAGGGAGEAAILIiWTPSVGlUVLWAGAFYIiFHVQEVNG 
LYRAQAELQHVI,GRAREAQ0U3PEPQLYALL\LGQQEGDRTKEO 






2618 

] 
< 


RVTPLri.KLLKKPKTAENQKASEENEITQPGGSSAKPGLPCLNF 
EAVLSPDPALIHSTHSLTNSHAHTGSSDCDISCKGMTERIHSIN 
I^NFSNSVLETLNEQRNRGHFCDVTVRIHGSMLRAQRCVLAAGS 
PFFQDKLLLGYSDIEIPSWSVQSVQKLIDFMYSGVLRVSQSEA 
LQIIiTAASILQIKTVIDECTRIVSQNVGDVFPGIQDSGQDTPRG 
TPESGTSGQSSDTESGYLQSHPQHSVDRIYSALYACSMQNGSGE 
RSFYSGAWSHHETALGLPRDHHMEDPSfflTRIHERSQQMBRYL 

sttpetthcrkqprpvriqtlvgnihikqemeddydyygqqrvq 

ILERNESEECTEDtDQAEGTESEPKGESFDSGVSSSIGTEPDSV 

eqqfgpgaardsqabptqpeqaaeapaeggpqtnqletgasspe 
^snevemdstvitvsnssdksvlqqpsvntsigqplpstqlylr 

SrETLTSNLRMPLTLTSNTQVIGTAGNTYLPALFTTQPAGSGPK 
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SEQ 
ID 
NO: 


ieredicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amano acid segment containing signal peptidS"* 
(A^Alanine, C^Cyeteine, D^Aspartic Acid, B= 

Glutamic Acid. P=Ph.envrlal ai-*-; r•^^^^..^i 

*^**^**y-La4.anine, Gs^Glycine, 
HaHistidine, I=Isoleucine, K=Lysine, 
L=l*eucine^ M=Methionine, W=Asparagine 
P=Proline, Q=Glutamins, RoArginine, 
S=Serine, T«Threonine, V^Valine, 
WoTryptophan, Y=Tyrosine, X=.Unknown, *t=stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide in^^vHi^n^ 


6964 






Pi'XFSI.PQPIAGQQTQPVTVSQPGJbSTFTAQLPAPQPLASSAGH 
STASGQGEKKPyECTLCNKTFTAKQNYVKHMFVHTGEKPHQCSI 
CWRSPSLKDYLIKVHMVTHTGVRAYQCSICDnCRPTQKSSLNVHM 
RLHRGEKSYECYICFCKKFSHKTLLERHVAIiHSASNGTPPAGTPP 
_ C3ftRAGPeGWACTEGTTYVCSVCPAKFJDQIEQFNDHMRMHVSDQ 


6965 


1 


178 • 


SGRPFFFFFSNTDVYPIKKVT'NRWTAGSSYKMTRMKSIGKILLL 
QIFIG\NCSMFVLVI 


6966 


757 


208 


NVFIEPRIQGFMKTSAHPGQKHPDPSMGLLFPLLAAI.EVCSCGS"" 
SGSLGYKrLPQNHXGLLGRWTLVLLGQMRRISPFLCLKDRSDFRP 
PQEKVEVSQLQKA\QAMSFLYDVr.QQVFNPSHKAriL\CCMEHDL 
PGPTPHFTSSAAGTPGDLLGAGDGRRRSWGQWVIEGSTLALRRY 


6 967 


820 ■ ■ 


1867 


IITALGVRGMPGCPCPGCGMAGPRLLFLTALALELLGRAGGSQP 
ALRSRGTATACRLDKKESESWGAliLSGERLDTWICSLLGSLMVG 
LSGVFPLLVIPLEMGTMLRSEAGAVfRI.KQLLSFALGGI>LGNVFI. 
HLLPEAWAYTCSASPGGEGQSLQQQQQLGLWVIAGILTFIALEK 
/HVPGQQGGGDQPGPQQRPHCCCRRAQWRPLSGPAGCRARPRCR 
GPXDIKVSGYLNLLAlflTIDNFTHGLAVAASFLVSKKIGLLTTMA 
ILLilElPHEVGDFAILLRAGFDRWSAAKLQLSTALGGLLGAGFA 
ICTQSPKGVEETAAWVLPPrSGGFLYIALVNVLPDLLEBEDPW 


6968 


162 


633 


GFIiPFKYWIIiDLSASSRMETDCNPMEIiSSMSGFEEGSELNGFEG 
TDMKDMkLEAEAWNDVL PAVNNMFVSKSLRCADDVAY INVBTK 
ERNRYCLELTEAGLKWGYAFDQVDDHLQTPYHETVYSLLDTLX 
SPAYREAFGKRNLLQRLEALKRDGQS 


6969 


1 


2265 


RGGGGGRGGPGARERERPGEPERTMEAAAGGRGCFQPHPGLQKT 

LEQFHI,SSMSSLQQPAAF5ARWAQEAYKKESAKEAGAAAVPAPV 

PAATEPPPVLHLPAIQPPPPVLPGPFFMPSDRSTERCETVLEGE 

TISCFWGGEKRIiCIjPQIIiNSVLRDFSLQQINAVCDELHiycSR 

CTADQLEILKVMGHiPFSAPSCGLITKTDAERLCNArJjYGGAYP 

i:'i'CitKKL4AASIAIX3LELSERSVRVYHE\CFGKCKGL\LVPELYS 

SPSAAClQCLD\CRr,MYPPHKFVVHSHKALBNRTCHWGF\DSA\ 

NWRAYILLSQDYTGKEEQARLGR\CLDDVKEKF£>YGNKYKRRVP 

RVSSEPPASIRPKTDDTSSQSPAPSEKDKPSSWLRTLAGSSNKS 

LGCVHPRQRLSAFRPWSPAVSASEKELSPHLPALIRDSFYSYKS 

FETAVAPNVALAPPAQQKWSSPPCAAAVSRAPEPLATCTQPRK 

RKLTVDTPGAPBTLAPVAAPEEDKDSEAEVBVESREEFTSSLSS 

LSSPSPTSSSSAKDLGSPGARALPSAVPDAAAPADAPSGLEAEL 

EHLRQALEGGLDTKEAKEKFLHEWKMRVKQEEKLSAALQAKRS 

LHQELEFLRVAKKEKLREATEAKRKLRKEIERUIAENEKKMKEA 

NESRLRLKRBLEQARQARVCDKGCEAGRLRAKYSAQIEDLQVKL 

QHAEADREQLRADLLREREAREHLEK\WK\ELQEQLWPRARPE 
AAGSEG\AAELEP 




1855 


118 

] 
1 
] 
( 
I 


AGT^aHGHLKVKTSEEQAEAKRLEREQKLKLyQSATQAVFQKRQA 
GELDESVLELTSQIU5ANPDFATLWNCRREVLQQLETQKSPEEL 
AALVKAELGPLESCLRVNPKSYGTWHHRCWLLGRLPEPNWTREL 
ELCARPLBVDERNPHCWDYRRFVATQAAVPPAEEIAFTDSLITR 
NPSNYSSWHYRSCLLPOLHPQPDSGPQGRLPEDVLLKELELVQN 
AFFTDPNDQSAWPYHRWLLGRADPQDALRCXiHVSRDEACLTVSP 
SRPLLVGSRMEIbLLMVDDSPLIVEWRTPDGRNRPSHVWLCDLP 
AASLNDQLPQHTFRVIWTAGDVQKECVLLKGRQEGWCRDSTTDB 
DLPRCELSVEKSTVLQSELESCKEI>QSLEPEWKWCL\LTIILLM 
RALDPLLYEKETLQYFQTLK\AWDPKRATY\LDDLRSKPLI.EWS 
l^KMEYAEVRVLHLAHKDLTVLCHLEQLLLVTHLDLSHNRLRTL 
PPALAALRCIiEDPPPRTXVLQASDNAIESLDGVTNLPRLQEUiL 

DNNRLQQPAVLQPLASCPRIiVLLNLQGNPLCQAVGILEQLAELL 
5SVS SVLT 
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SEQ 
ID 
NO: 


predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
araino acid 
sec[uence 


Ammo acid segment containing signal peptide " 
(A=Alanine, C=Cysteine, D^^Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I-Isoleucine* K=Iiysine, 
L=Leucine, M^Methionine, N=Asparagine , 
P«Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V^Valine, 
W-Tryptophan, Y^Tyroaine, X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


6970 


3 


1528 


SFPPIiLSSPSAVGEGKVAVAAPCPGRSECARAKMAyiQLEmTE 
GFLSRISGLLIjCRWTCRHCCQKCYESSCCQSSEDBVEILGPFPA 
QTPPWLMASRSSDiCDGDSVHTASEVPLTPRTNSPDGRRSSSDTS 
KSTYSLTRRISSLESRRPSSPLIDIKPIEFGVLSAKKEPIQPSV 
LRRTYNPDDYFRKFEPHLYSLDSITSDDVDStirDBEILSKYQLGM 
LHFSTQYDLLHNHLTVRVIBARDLPPPISHDGSRQDMAHSNPYV 
KICLLPDQKNSKQTGVKRKTQKPVFBERYTFEIPFLEAQRRTLL 
LTVVDFDKFSRHCVIGKVSVPLCEVIJriVKGGHWWKALIPSSCfNE 
VELGELLLSLWYLPSAGRLNVDVIRAKQLLQTDVSQGSDPPVKI 
QLVHGLKLVKTKKTSPLRGTIDPFYNESPSPKVPQEELEWASLV 
FTVFGHN^4KSSNDPIGRIVIG\QysSGP\SEPNHWRRMLNTHRT 
AVEQWHSLRSRAECDRVSPASIiEVT 


6971 


37 


3702 


ACFWPGSRSFKLIPRHGLVNMGRSGKLPSGV'SAKI.KRWfaCGHS 

SDSNPAICRHRQAARSRFFSRPSGRSDriTVDAVKIiHNELQSGSL 

RLGKSEAPETPMEEEAELVLTEKSSGTFLSGLSDCTNVTPSICVQ 

RFWESNSAAHKEICAVLAAVTEVIRSQGGKETETEYFAALIRKA 

AQHGVCSVLKGSBFMFEKAPAHHPAAISTAKFCIQEIEKSGGSK 

EATTTLHMLTLLKDUiPCPPEGLVKSCSETLLRVMTLSHVLVTA 

CAMQAFHSLFHARPGLSTLSAELNAQIITALYDYVPSEN0LQPL 

LAWLKVMEKAHINLVRLQWDLGLGHLPRFFGTAVTCXLSPHSQV 

LTAATQSIjKEILKECVAPHMADIGSVTSSASGPAQSVAKMFRAV 

EEGLTYKFHAAWSSVLQLLCVFFEACGRQAHPVMRKCLQSLCDL 

RLSPHFPHTAALDQAVGAAVTSMGPEWI.OAVPLBIDGSEETLD 

FPRSWLLPVIRDHVQETRLGFFTTYFLPLANTLKSKAMDUVQAG 

STVESKIYDTLQWQMWTLLPGFCTRPTDVAISFKGLARTLGMAI 

SERPDLRVTVCQALRTLITKGCQAEADRABVSRFAKNFLP ILFN 

LYGQPVAAGDTPAPRRAVLETIRTYLTITDTQLVNSLLBKASBK 

VLDPASSDPTRLSVX^DLWALAPCADEAAISKLYSTXRPYLESK 

AHGVQKKAYRVLEEVCASPQGPGALPVQSHLEDLKKTLLDSU^S 

rSSPAKRPRIiKCLLHIVRKLSAEHKEFrTALIPEVXLCTKEVSV 

GARKNAFALLVEMGHAFLRFCSNQEEAIiQCYLVLIYPGLVGAVT 

PWSCS IIALTHLIiFEFKGLMGTSTVEQLIiENVCLLIiASRTRDW 

KSAIiGFI KVAVTVMDVAHLAKHVQLVMEAIGKLSDDMRRHFRMK 

LRNLFT\KPIPK\FGILTWGKKAVGPKBYHRVLVNIRKAEARAK 

RHRALSQAAVEEEEEEEEEEEPAQGKGDSIEEILADSEDEEDNE 

EEBRSRGKEQRKLARQRSRAWLKEGGGDEPUJIPLDPKVAQRVIA 

TQPGPGRGRKKDHSFKVSADGRLIIREEADGNKMEESEGAKGED 

aairim-rir'WjiUVl XRNKKHQKLKHQKETiEEEBLEI PPQYQAGGSGI 

HRPVAKKAMPGAEYKAKKAKGDVKKKGRPDPYAYIPIiNRSKLNR 

RKKMKLQG0FKG1*VKAAQR6SQVGHKNRRKDRRP 


6972 
6973 


2179 


973 


PGGAILLPLWRRTRPREATVPRGAAQRGRARSAEGRIPSSQSPS 
PAEAGGATRSPPPRPPRPARPPGPSAPPLLRSDAGPGATVSAAA 
AAATERARRGATMGAQLSTLGH^5VLPPVWFLYSLLMKLPQRSTP 
hlThES PDIKYPLRLIDREI ISHDTRRFRFALPSPQHILGLPVG 
QHIYLSARIDGNIiWRPYTPISSDDDKGFVDLVIKVYFKDTHPK 
FPAGGKMSQYLBSMQIGDTIEFRGPSGLIiVYQGKGKFAIRPDKK 
SNPIlRTVKSVGMIAGGTGITPMIiQVIRAIMKDPDDKTVCHLLF 
ANQTEXDILIiRPELEELRNKHSARFKLWYTLDRAPBAWDyGOGX 

FVNEEMIRDHLPPPE\EEPLVLMCGPPPMIQYACLPNL\DHVGH 
PTERCFVF 




1 


1964 

] 
] 


IiQPRCAHRGLRAQKCGRPAPGVnAMVLCPVIGKLLHKRVVLASA " " 

SPRRQBHiSWAGLRFEWPSKFKEKLDKASFATPyGYAMBTAKQ 

KALEVANRLYQKDLRAPDWIGADTIVTVGGLILEKPVDKQDAY 

aMLSRPE/SGREHSVFTGVAlVHCSSKDHQLDTRVSEFYEETKV 

^SELSEELLWEYVHSGEPMDKAGGYGIQALGGMLVESVHGDFL 

WGFPLNHPCKQLVKLYYPPRPEDLRRSVKHDSIPAADTPBDL 



580 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted en3" 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
seejuence 


Amino acid segment containing signal peptide 
(A»Alanine, C=Cysteine, D=Aspartic Acid, E« - 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 
H=Hietidine, I=Xsoleucine, K=Lysine, 
L=Leucine, M=Methionine, N*Asparagine , 
P^Proline, Q=Glutamine, R=Arginine, 
S=Setine, Ts= Threonine Vs=Val i-na 
W=Tryptophan, Y^Tyrosine, X-Unknown, *=.Stop 
Codon, /^possible nucleotide deletion, 
\=po6sible nucleotide insertion) 


6974 






SDVEGGGSiit-raRDAGSRDEKAEAGEAGQATAEAECHRTRETLP 
PFPTRtiLELIEGFMLSKGLLTACKLKVFDLLKDEAPQKAADIAS 
KVDASACGMERLLDICAAMGLLEKTEQGYSWTETAMVYLASDGE 
* -^"^^^^^-iji WNLFTYLEFAIREGTNQHHRALGKKAEDLF 
QDAYYQSPETRLRFMRAMHGMTKLTACQVATAFNLSRPSSACDV 
GGCTGALAREIAREYPRMQVTVFDLPDIIEIAAHFQPPGPQAVQ 
IHFAAGDFFRDPLPSAELYVLCRILHDWPDDKVHKLLSRVAESC 
j^r(j/v\,ijiji,v t f LLDEE KRVAQRALMQSLNMLVQTEGKERS tiGEY 
QCLLELHGFHOVQWHLGGVLDAIL\PPKWPPRAnJianQT. 


6975 


3082 


2172 


K^CAAFASFASRPPLELPAPPGSHRSPPGRGVatSAQCALSVRK ' 

LLAARPGLGTKYQATMVYKTLFALCILTAGWRVQSLPTSAPLSV 

SLPTJsriVPPTTIWTSSPQNTDADTASPSNGTHNNSVLPVTASAP 

TSLLPKNISIESREEEITSPQSNWEGTNTDPSPSGFSSTSGGVK 

ui i ^J^i^EMSLGTPEAGVAATLSQSAAEPPTLISPOAPASSPSSL 

STSPPEVFSASVTTNHSSTVTSTQPTGAPTAPESPTEESSSDHT 

PTSHATAEPVPQEKTPPTTVSGKVMCELIDMET\PPPPl>f3 


6976 


2 


500 


KPRPTVHCCKWALKLETi^MK-JLIJJVFHAHSGKBGDKYKLSKKgL 
KELLQTELflGFLDVKELML*ATEALKTPEEA*KSPIIQCSSSRS 
SLPPAPQPPPyr,*LSAVPFPlHLPLPLLPPQAQKDVDAVDKVMK 
i5jji/iinijutfiiViJi«OEyvviiVAALTVACNNFFWE2^S 


6977 


1215 


970 


GCQL^VAH^I-TENSPVTFAHPPEDTVEQKftESVGRIMPHTEARl"- 
1 ii.iiiurtoxi4*ijMjiM 1 t'ijt,ijCIRGYCVMLGYWGEPQK?rEBAVDQDKW 
YWTGDVATMNEQGPCKIVGHSKDMIIRGGENIYPAELEDFFHTH 
PKVQEVQWGVKDDRMGEEICACIRLKDGEETTVEEIKAPCKGK 

ISHPKIPKYIVFVTNYPIiTISGKIQKPKLREQMERHLNL*rKQO 
ACPGRlA 


6978 


1298 


588 


ai^FINTNLLSNWii^KTSJbXjMCSEPISDNTKDCjKGKLKTPDFA^R" 

ANKKSKHHVNGNRTVEPFPEGTQMAVFGMGCFWGAERKFWVLKG 

VYSTQVGFAGGYTSNPTYKEVCSSKTGHAEWRWyQPEHMSFE 

ELLKVFWENHDPTQGMRQGNDHGTQYRSAIYPTSAKQMEAALSS 

KENYQKVLSEHGPGPITTDIREGQTFYYAEDYHQQYLSKNPNGY 
CGLGGTGVS CP VGIKK 


6979 


3 


242 


airi^KRDSRRCGCCKGSSLRHTAVAMVKLSKEAKQRIKSQTFKG^ 
FAIRMGFIPIiVIYLGFKRGADPGMPEPTVLSLLWG 


6980 


3917 


1146 

] 
C 
I 
I 

c 

" 420 


DiARVRGSAVAAAXI/SRCRHWSGPPPPPPSPPDRKGLRGTEPWE 

AGPGSGATPGARAMDVRRLKVNELRBELQRRGLDTRGLKTELAE 

RLQAALEAEEPDDERELDADDEPGRPGHINEEVETEGGSELEGT 

AQPPPPGLQPHAEPGGYSGPDGHYAMDNITRQNQFyDTQVIKQE 

NESGYERRPLEMEQQQAYRPEMKTBMKQGAPTSFLPPEASQLKP 

DRQQFQSRKRPYEENRGRGYFEHRBDRRGRSPQPPAEEDEDDFD 

DTLVAlDTYNCDLHFKVARDRSSGYPLriBGPAYLWSGARASYG 

VRRGRVCFEMKINEEISVKHLPSTEPDPHWRIGWSLDSCSTQr, 

GEEPFSYGYGGTGICKSTNSRFENYGDKFAENDVIGCFADFECGN 

DVELS FTKNGKPfMGIAFRXQKEALGGQALYPHVLVKNCAVBFWF 

GQRAEPYCSVLPGFTFIQHLPLSERIROTVGPKSKAECBILMMV 

GLPAAGKTTWAIKHAASNPSKKYNILGTNAIMDKMRVMGLRRQR 

NYAGRWDVLIQQATQCLNRLIQIAARKKRNYILDQTWVYGSAQR 

RKMRPFE6FQRKAIVICPTDBDLKDRTIKRTDEEGKDVPDHAVL 

3MKANFTLPDVGDFLDEVLFIELQREEADKLVRQYNEEGRKAGP 

^PEKRFDNRGGGGFRGRGGGGGFQRYENRGPPGGWRGGFQMRGG 

SSGGGGNYRGGFNRSGGGGYSQNRWGNNNRDNNNSNNRGSYNRA 

'QQQPPPQQPPPPQPPPQQPPPPPSYSPARNPPGASTYNKNSNI 

'GSSANTSTPTVSSYSPPQSFGFFPSTFQPSYSQPPYNQGGYSQ 

lYTAPPPPPPPPPAYNYGSYGQYNPAPYTPPPPPTAQTYPQPSY 

fQYQQYAQQWNQYYQNQQQWPPYYGNYDYGSYSGNTOGflTfiTn 

fTRGRKTGRVAAPiJTRRRTGNMOKDQTRSPAMSLSDPGLGYHPT 



581 
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SEQ 
ID 
NO: 


Predicced 
beginninof 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


«.»*«v^ ^^^^ i;.e^enc containing Signal peptide 

(A^^Alanine, C=Cysteine, D^Aspartic Acid, E= 

Glutamic Acid, P-Phenylalanine. G=Glycine, 

H=Hista.dine, I='Isoleucine, K«Lysine 

L=:Leucine, M=Methionine. N-Asparagine, 

P^Proline, Q==GIutamine, R-Arginine, 

S^Serine, T^Threonine, V:=Valine, 

W=Tryptophan, Y-Tyrosine, X=UnJcnown, *==stop 

Codon, / ^possible nufl i « *. j 

/ / j,jja,t^ ttttcieotxcie deletion, 

\'=po3sible nucleotide insertion) 


6981 






CMTU^WPPLLbbHALHVFKCLFSSRLGTPVSFKLAMDPNC^CEA 

GGSCACAGSCKCKKCKCTSCKKSCCSCCPLGCAKCAQGCICKGA 
5 EiXCS CCA 


6982 


10 


1054 


FUHGFRRA3i.HPAFAARGVFgGGLGQAKQARTRACftALPTPHPS 
APRIiLEPQGVFSLFPPPPGPWPNMILTKAQYDEIAQCLVSVPPT 
RQSLRKLKQRFPSQSQATLLSIFSQEYQKHrKRTHAKHHTSEAI 
ESYYQRYIJTGWKNGAAPVIiDIANEVDYAPSLMARLILERFLQ 
* * « iiJ^i^ ir ay 1 i'UbVIiANQvYQCIVNDCCYGPL 
VDCIKHAIGHEHEVbLRDLLLEKMLSFLDEDQLRAKGYDKTPDF 
ILQVPVAVEGHIIHWIESKASPGDECSHHAYLHDQFWSYWNRFG 
_ JGbVI YWYGPIQELDCNRSRGILLKACFPTNI VrLC!HS 


6983 


153 


1285 


FPQQuu-^Ai:'AAPGlA05EPRKLRAYRRRRQRAKGLKRVAWbAPP 
PSLLQGIX2GWAQAPVDgTLGPEDSRASSPMIQNSRPSLbQPQDV 
n r. r^rrSi irJ, KAfLCGS ISGTCSTLIiFQPLDLLKTRLQTLQ 
PSDHGSRRVGMLAVLLKWRTESLLGLWKGMSPSIVRCVPGVGI 
YFGTLYSLKQYFLRGHPPTALESVMLC3VGSRSVAGVCMSP1TVT 
KTRYESGKYGyESIYAALRSIYHSEGHRGLFSGLTATLLRDAPF 
SGIYLMFYNQTKNIVPHDQVDATLIPXTNFSCGIPAGILASLVT 
W*-**!^ V J. jj^innuij 3£ i'l'KFQWiGQAVTLI PKDYGLRGFPOGGI PRA 
LRRTLMAAMAV7TVYEEMMAKMGLKS v^v:^ J- itm 


6984 


82 


773 


BMSFIiQDPSrFTMGMWSIGAGALGAAALALLlANTDVFLSKPQK" 
AALEYLEDIDLKTLEKEPRTFKAKELWEKNHAVIMAVRRPGCPL 
CREEAADLSSLKSMLDQLGVPLYAWKEHIRTEVKDFOPYFKGR 
X I GPQRRKMMPMGFIRLGVWYNFFRAWNGGFSGNLE 
^^^^^^VVGSGKQGILLBHREKEF^DKVNLLSVLEAAK^ 


69BS 


1845 


1282 


UliKaAYSLPAGSbPRVPATAAAKMASGVQVADKVCRIPYDMKVR 
KCSTPBEIKKRKfCAVIFCLSADKKCriVEEGKEILVGDVGVTIT 
DPFKHFVGMLPEKDCRYALYDAjSFETKESRKEELMFFLWAPEIA 
r-iARoiunj. 3CH£>5KnAIKKKFQGIKHECQANGPEDLNRAClAEKLG 
GSLIVAFEGCPV ^r^^ 


6986 " 


1887 


1324 


«KU^L^lYI^,^i,-PKPGRrkHALUtjVVLbI^TGQIJ^DDFQE3CA^^ 

WQKYAGSRRSMPLGARILFHGVFYAGGFAIVYYLIQKPHSRALY 

YKLAVEQLQSHPEAQEALGPPLNIHYLKLIDRENFVDIVDAKLK 

IPVSGSKSEGLLYVHSSRGGPEXJRWHLDEVFLELKDGQQIPVPK 
LSGEKGDEVKKE 


6987 - 


642 


1350 


YHLYFKMGDPNaKKKQALHrRbRAQLRKKKKSlADQFDPKMY I AP " 

VFKEKKKKSALPEVSEVIPVMTNNYEENILKGVRDSSYSLESSI, 

ELLQKDWQUiAPRYQSMRRDVIGCTQEMDFILWPRNDISKIVC 
LLFSHWKESDBPFRPVOAKFEFHHGDYPTfnRT -mn et> vi^t^^-r,, 

VIWPNQSVPLFIDRQHI^TPKNKATIFKLCSICLYLPQEQLTHM 
AVGTIEDHLRPYMPE v w xih 


656 8 


1623 


341 

( 
< 

] 


liSAAEKASRAyKSSQRQTDiiKNyKXEWWSPQKSQRRyOMYNTAC 
FL6E1EVGLYTIQILQLTPFPHKENELSKKHMVQPLSGKWTIPP 
Ui'KNeCYriALSKFrSHLiO^LQSDLKRCFDFFIDYMVIjLKMRYTO 
KEIABIMLSKKVSRCFRKYTELFCHU)PCLLQSKESQLLQEENC 
RKKLEAbRADRFAGLLEYLNPNYKDATTMESIVNEYAFLLOONS 
KKPMTWEKQNSIIANiri^CLKPWSiCr.IQPI.TTLKKQLREVLQF 
S/GLSHQYPGPYFLACtLFWPENQELDQDSKIiIEKYVSSLNRSFR 
3QYKRMCRSKQASTLFYLGKRKGLNSIVHKAKIEQYPDKAQNTN 
SLWHSGDVWKKNEVKDLLRRLTCQAEGKLISVBYGTEEKIKIPV 
CSVYSGPLRSGRNIERVSPYLGPSIEGPPGL 




3 


f 
E 


'QLLRRPAVPVUSAASGlKfiJGbWSASSGHWCAPAAUKAHAPVPR 
iVRGLGAASTAAPQDAQTGPQPMPRADCIMRHLPYFCRGQWRG 
^GRGSKQLGIPTANFPEQWDm^ADlSTGIYYGWASVGSGDVH 
MWSIGWMPYYKNTKKSMETHIMHTPKEDFYGEILNVAIVGYL 



582 
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SEQ 
ID 
NO: 


ficcii c Ceci 
beginning 
nucleotide 
location 
corr e spending 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino aca.a segment containing signal peptide"" 
(A=Alanine, ^Cysteine, D-^spartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K^Lysine, 
L^Lieucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glut amine, R=Arginine, 
S=Serine, T=Threoaine, VsValine, 
W=Tryptophan, y=Tyroaine, X=.Unknown, *==Stop 
Codon, /^possible nucleotide deletion, 
\«possible nucleotide insertion) 


6989 






KFaKNFDSLKablSAIQGDlEEAKKRLELPEHLKIKEDNFFQVS 
KSKIMNGH 


6990 


2 


1118 


liMPSDRPLsPSTHASAGSHCHAPPTTARRAFPIPF^SKSNMATL 
KDQLIYNLLKEEQTPQNKITWGVGAVGMACAISILMKDLADEL 
ALVDVIEDKLKGEMMDLQHGSLPLRTPKIVSGKDYNVTANSKLV 
IITAGARQQEGESRLKLVQRNVWIFKPIIPI^rWKySPNCKLLIV 
SNPVDILTYVAWKISGFPKNRVIGSGCNLDSARFRYLMGERLGV 
HPLSCHGWVLGEHGDSSVPVWSGMNVAGVSLicrLHPDLGrDKDK 
EQWKEVHKQWESAYEVIKLKGYTSWAIGLSVADLAESIMKNLR 
RVHPVSTMIKGLYGIKDDVFIiSVPCILGQNGISDLVKVTLTQPP 
EARLKKSA0TLWG I QKELQF 


6991 


719 


2S8 


THASGMASWWVLRTRTAVTSLLSPTPATAIAVRYASKKSGGSS 
KJrLGGKSSGRRQGlKKMSGHYVHAGNI TATORHiiT? WHTsr'a myr.T7 

GKNKCLYAI.BEGIVRYTKEVYVPHPRNTEAVDLITRLPKGAVLY 
KTFVHWPAKPBGTFKLVAML 


6992 


169 


451 


KKSSDFHNPUi<bSRPVSLRENljjHQVICSTkMKkUNPKKIAYLL 

SSLLMTNLNPNESTENQPVDAYWAFTLDQEFLTYACVEGTGCLP 
CGRHVH 


6993 


944 


510 


ROAPGCSSLALRQVRQVYCGLVRAPOVOTRPLSSHFTTT^PPraaT v — 
RSPMNQENPPPYPGPGPTAPYPPYPPQPMGPGPMQGPYPPPQGY 

PYQGYPQYGWQGGPQEPPKTTVYWEDQRRDELGPSTCLTACWT 
ALCCCCIiVTOMLT 


6994 


1 


374 


OWCVTCPUmiARQGPAVPPGIQAYgAAPFEDLgVDFTEMSKCRG ' 

DRVWIKNWNVASLCPLWKGPQTWLSPPTAVKVE6IPAWIHHSH 

VKPAARBTWEJmPSPDNPFRVTLKKTTSPAPVTPGS 


^99^- 


346 


1100 


ywPEKDPVMAAaSISSPWGIOiVFKAII^ViOlilLLHSAXAQSR 
RDFAPPGQQKREAPVDVLTQIGRSVRGTLDAWIGPETMHLVSES 
SSQVLWAISSAISVAFFALSGIAAQLLNALGLAGDYLAQGLKLS 
PGQVQTFLLWGAGALWYWLLSLLLGLVLALLGRILWGLKLVTP 

LAGPVALMRSVPDPSTRALLI1LALI.ILYALLSRLTGSRASGAOL 

EAKVRGLERQVEELRWRQRRAAKGARSVEEE 


6996 " 


144 


1346 


GS VAVGI^GlMAAQKDLWDAl VIGAGICKSCFTAYHLAKHRKRIL 
LLEQPPLPHSRGSSHGOSRriRKAYLEDFYTRMMHECYQIWAQL 
EHBAGTQLHRQTGLLLLGMKENQELKTIQANIiSRQRVEHQCLSS 
EELKQRFPNIRLPRGEVGLLDNSGGVIYAYKALRALQDArRQLG 
GIVRDGEKWEXNPGLLVWKTTSRSYQAKSLVITAGPWTNQLL 
RPLGIEMPLQTLRINVCYWREMVPGSYGVSQAFPCPLWLGLCPH 
HIYGLPTGEYPGLMKVSYHHGNHADPEERDCPTARTDIGDVQIL 
SSFVRDHLPDLKPEPAVIESCMYTNTPDEQFILDRHPKYDNIVI 
GAGFSGHGFKIiAPWGKILYELSMKLTPSYDlAPFRTSPPPciTn 


6997 ' 


543 


1942 

. 
] 
I 


gTANAEAAARKSAMDWKBVLRRRLATPNTCPNKKKSEQELKDEE 
MDIiFTKYYSEWKGGRKNTNEFYKTIPRFYYRLPAENEVI>LQKLR 
c c 0 tui V r -i-'WitKb KlLXiliDNEELQNLWPLLDKHQTP PMIGEE AMIN 
YENPLKVGEKAGAKCKQFFTAKVFAKLLHTDSYGRISIMQFPNY 
VWRKVWLHQTRIGLSLYDVAGQGYLRESDLEWYILELIPTLPQL 
DGLEKSFYSFYVCTAVRKFFFFLDPLRTGKIKlQDrrACSPLDD 
EJiELRDEELSKESQETNWFSAPSALRVYGQYriNLDKDHKGMLSK 
EELSRYGTATMTNVFLDRVFQECLTYDGEMDYKTYLDFVLALEN 
?KEPAALQyiFKLLDIENKGYIJJVFSLNYFFRAIQELMKIHGQD 
PVSPQDVKDEIPDMVKPKDPLKISLQDLINSNQGDTVTTILIDL 
flGFWTYENRKALVANPSENSADLDDT 




J70 


1104 ; 

1 


VMBLTIFILRLAIYILTPPLYLl^FLGLWSWICKKWFPyPLVRF 
PVIYNEQMASKKRELFSNLQEFAGPSGKLSLLEVGCGTGANFKF 
fPPGCRVTCIDPHPNFEKPLIKSIAENRHLQFERFWAAGENMH 



583 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
correspondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide " 
{A»Alanine, C-Cysteine, D.*Aspartic Acid, E= 
Glutamic Acid, Phenyl alanine, G^Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
LsLeucine, M=Methionine, N==Asparagine , 
r-ii-i^uiiue, u-Cyiutamine, R^Argmine, 
s=Serine, T^Threonine , V=Valine, 
W=Tryptophan, Y^^Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotide insertion) 


6998 






QVADGSVDVWCTLVLCSVKNQERIjLREVCRVLRPGGAFYFMEH 
VAAECSTWNYFWQQVLDPAWHLLFDGCNLTRESWKABERASFSK 
IiKLQHIQAPLSWELVRPHI YG YAVK 




2 


616 


FVSRALLRVRSRRHPAEERAAPGRPEDAPIECPGATNCPEPLWC 
SHLPVPYAPPTMESRGKSASSPKPDTKVPQVTTEAKVPPAADQK 
APLTKPSKKEAPAEKQQPPAAPTTAPAKKTSAKADPALLNNHSN 
LKPAPTVPSSPDATPEPKGPGDGAEEDEAASGGPGGRGPWSCEN 
r IN r JJ Jj V AWjr VA V AAx AIjI LGVAFLVRKK 


6999 


14 


1591 


GRAGACSRRDTAMSIEIESSDVIRLIMQYLKENSLHRALATLQE 
ETTVSLNTVDSIESPVADINSGHWDTVLQAIQSLKLPDKTLIDL 
YEQWLELIELREI^GAARSJLLRQTDPMIMLKQTQPERYIHLENL 
LARSYPDPREAYPDGSSKEKRRAAIAQALAGBVSWPPSRLMAL 
LGQALKWQQHQGLLPPGMTIDLFRGKAAVKDVEEEKFPTQLSRH 
IKFXBQKSHVECARFSPDGQYLVTGSVDGPIEVWKFTTGKrRKDL 
KYQAQDNFMMMDDAVIjCMCFSRDTEMLiATGAQDGKIKVWKIQSG 
QCLRRPERAHSKGVTCLSFSKDSSQILSASFDQTIRIHGLKSGK 
TLKEFRGHSSFVNEATFTQDGHYIISASSDGTVKIWNMKTTECS 
NTFKSLGSTAGTDITVWSVILLPKNPEHPWCWRSMTVVIMNMQ 
GQIVRSFSSGKREGGDFVCCALSPRGEWIYCVGEDFVLYCFSTV 
TGKLERTLTVHEKDVIGIAHHPHQNLIATYSEDGLLKLWKP 


7000 
7001 


2 


827 


GPGWFLBliMBSEGPPESERSEPFSQREEENEEEEAQEPEErGP 
KNPIjLQPALTGDVEGLQKIFEDPENPHHEQAMQLLLEEDIVGRN 
LLYAACMAGQSDVIRALAKYGVNLNEKTTRGYTLLHCAAAWGRL 
ETLKALVELDVDIEALNPREERARDVAARYSQTECVEFraDWADA 
RLTLKKYIAKVSIAVTDTEKGSGKLLKEDKNTILSACRAKNEWL 

ETHTEASINELFEQRQQLEDIVTPIFTKMTTPCQVKSAKSVTSH 
DQKRSQDDTSN 




2056 


844 


RRCIillAPLKGCFIPIYFlFrPETEPLSCCPGWSAVAQSRLiAN 
FASQVQAI PILPKDSQVGPD VKSEAAPKRALYES VFGSGE I CGP 
TSPKRLCIRPSEPVDAVWVSVKHDPLPLLPEAWGHRSTJNSPTI 
VSPAIVSPTQDSRPWMSRPLITRSPASPLNNQGIPTPAQLTKSN 
APVHIDVGGHMYTSSLATLTKYPESRIGRLPDGTBPIVLDSLKQ 
HYFIDRDGQMFRYILNFLRTSKLLIPDDFKDYTLLYEEAKYFQL 
QPMLLEMERWKQDRETGRFSRPCECLVVRVAPDLGER1TI,SGDK 
SLIEEVFPEIQDVMGNSVNAGWNHDSTHVIRFPLKTGYCHLNSVQ 
V iuiiLEUJvuKuc vijis^^w^t^VDSSQFSEYvTjRRBLRRTPRVPS VI 
RIKQEPLD 


7002 
7003 


1043 


498 


PMPSSTRWTrs*TYTI>rSSAWACRPTTGTCr*TAAPGPTVRWWP 

TPCSRHQSRRRLTCWCSTSRPCGR*GGLCVRTAPTRPTTSASSS 

SPTTSAGTSWPAGRRTGTATSGTATTTSVWPGCGTRMWSTQWSSV 

PRSRSCCSRPATTPPSKPGAPHAPCASSRHLAKQLAPSSPGLPA 
RGAEVC 


7004 


8ie 


61 


QGRFRAFCWQRDFLQPPGMRLSALLALASKVTLPPHYRYGMSPP 
GSVADKRKNPPWIRRRPVWEPISDEDWYLFCGDTVEILEGKDA 
GKQGKWQVI RQRNWWVGGLNTHYR YIGKTMD YRGTMI PSEAP 
LLHRQVKLVDPMDRKPTEIEWRPTEAGERVRVSTRSGRIIPKPE 
FPRADGIVPETWIDGPKDTSVEDALERTYVPCLKTLQEEVMEAM 
GIKETR\NTRRSIGIEPGAEQLLPNPCPSLEG 




121 


2285 

1 


FIiLPVLTSRSLRQPAVPHARLGGVEPAAMKSARAKTPRKPTVkk 
G\PKRTLKTQLG/YYCRVRPLGFPDQECCISVINNTTVQLHTPE 
GYRLNRNGDYKETQ YS FKQVFGTHTTQKELFDWANPLVNDL IH 
SKNGLLPTYGVTGSGKTHTMTGSPGEGGLLPRCLDMIPWSIGSP 
BAKRYVFKSNDRNSMDIQCBVDALLERQKREAMPNPKTSSSKRQ 
TOPEPADMITVQBFCKABEVDEDSVYGVFVSYIEIYNNYIYDLL 
BEVPFDPINPNLHNLITCPVKI KNHNMYVAGCTEVEVKSTEEAFE 
/PWRGQKKRRI ANTHLNRBSSRSHSVFNI KLVQAPLDADQDNVL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide"" 
tA=Alanine, C«Cy3teine, D^^Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=i,ysine, 
L^Leucine, M=Methionine, N=rAsparagine , 
P=Proline, Q^Glutamine, R=Arginine, 
S=Serine, T=Threonine, v=.Valine, 
W^Tryptophan, Y=TyroBine, X«Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


7005 






QEKEQITISQLSLVDLAGSERTNRTRAEGNRLREAGNINQSLMT 
LRTCMDVLRENQMYGTNKiyiVPyRDSKLTHLFKNYFDGEGKVRMI 
VCVNPKAJEDYEENLQVMRFAEVTOEVEVMPVDKAICGLTPGRR 
YRNQPRGP\lGNEPLVTDWLOSPPPT.PRryTT nriOTiTPnTT nisT 

lEALEKRHNLRQMMIDEFNKQSNAFKALLQEFDNAVLSKENHMQ 
GKLNBKEKMISGQKLEIERLEKKNKTrjEYKIEIIiEKTTTIYEED 
KRNLQQELETQNQKLQRQFSDKRRIjEARLQGMVTETTMKWEKEC 
ERRVAAKQLEMQNKLWVKDEKLKQLKAIVTEPKTEKPERPSRER 
DREKVTQRSVSPS P VPVSYL 


7006 


63 


876 


RNWALYQRWRCLRLQGI^ACRLHTAWSTPPRWIJ^^RLGLPEEL 

V ^\JXJa«^l•lrtyJ^JbyKl iKlfeXiPGGQKIDAVAWNTTPYQIARQ 

ISSTLADTAVAAQVWGEPYDLERPLETDSDLRFLTFDSPEGKAV 

FWHSSTHVLGAAAEQFLGAVLCRGPSTEYGFYHDFFLQKERTIR 

GSELPVXJERICQELTAAARPFRRLEASRDQLRQLFKDNPPKLHL 

lEEKVTGPTATVYGCGTLVDLCQGPHIiRHTGQIGGLICLLSNSSS 
LWRSSG 


7007 


22 


898 


JMAbX:}KHSTAVKMAAaAWX^VLPVILIXLQAj^^SPi:;S PFSAGPAt" 

VAAADRSKWHIPIPSGKNYFSFGKILFRNTTIFLKFDGEPCDLS 

iMtj^xn xiift-bAUt, YWJB I ynFKAEEVEIjYLEKLKEKRGLSGKYQTS 

SKLPQNCSBLFKTQTFSGDFMHRLPIiIiQEKQEAKENGTNIjTFIG 

DKTAMHEPLQTWQDAPYIFIVHlGISSSKESSKENSLSNIiFTMT 

VEVKGPYEYLTLEDYPLMIFFMVMCIVYVLFGVLWLAWSACYWR 

DLLRIQFWIGAVIFLGMLEKAVFYAGFQ 




2 


1001 


AMTVSGPGl-^'EPRPATPGASaVEQLRKEGyBLFKCXSDYG^AlAA ' ' 

YTQALGLDATPQDQAVLHRNRAACHLKIiEDYDKAETEASKAIEK 

DGGDVKALYRRSQALEKLGRLDQAVLDI^RCVSLEPKNKVPQEA 

brniggqiqekvrymsstdakveqmfqilldpeekqtekkqkas 

QNLWLAREDAGAEKI FRSNGVQLLQRLLDMGETDLMLAALRTL 
VGICSEHQSRTVATLSIUSTRRWSILGVESQAVStiAACHLLQV 

PGQFFGDQTCSCRLPG 1 RPGDI 1 hh 


7008 


70 


1478 


CRSALGHERPPPAHLPAGGRRLQTCPRSCRWLGRPPSGLPPGPR 
SPPPIAGPGQJtMVQKKPAELQGFHRSFKGQNPFEIiAPSLDQPDH 
GDSDFGLQCSARPDMPASQPIDIPDAKKRGKKKKRGRATDSFSG 

IRSRVFREVEiyiLYQCQGHRNVLELIEFPEEEDRFYI,VFEKMRGG 
SriiSHIHKRRHFNELEASWVQDVASALDFLHNKGIAHRDLKPE 
NILCEHPNQVSPVKICDPDLGSGIKbNGDCSPISTPELLTPCGS 
AEYMAPEWEAFSEEASiynKRC^DLWQLfiVTr.VTT t o/^vtsbim//^ 

RCGSDCX5WDRGEACPACQNMLFESIQEGKYEFPDKDWAHISCAA 
KDLISKLLVRDAKQRLSAAQVLQHPWVQGCAPEWTLPTPMVLQR 
WDSHFLLPPHPCRIHVRPGGtiVRTVTVNE 


• 7009 


1 


626 


ARQLRNSWVDDFVAAPLIPLSQQI PTGNSLYESYYKQVDPAYTG " 
RVGASEAALFLKKSGLSDIILGKIWDLADPEGKGFLDKQGFYVA 
LRLVACAQSGHEVTLSNLNLSMP PPKFHDTS S PLMVTPPSAEAH 
WAVRVEEKAKPDGlFESLLPlNGLLSGDKVKPVrj^4NSKLPLDVL 
GRVWDIiSDIDKDGHLVRDEFAVAMHLVYRALE 


7010 


79 


571 


SHTRRAWPETLLSPLCPLLGGGTAMSGGEQKPERYYVGVDVGT 
GS VRAAIiVXiQSGVLLAFADQP IKNWBPQFNHHEQSSEDI WAACC 
WTKKWQGIDLNQIRGIiGFDATCSLWLDKQFHPLPVMQEGDS 
HRNVIMWLDHRAVSQVNRINETKHSVLQYVGG 


7011 


3 


994 

] 


RIQTLPNQNQSQTQPLLKTPPAVLQPIAPQTTFGVQTQPQPQSL 
LOAQISAASITPLLQTQPQPLLQQPQQKAGLLQPPVRIVSQPQP 
?UtRLDPPSRPSGRNDRGDQVPNRKDDRSRERERERRRSRERSPQ 
RKRSRERSPRRERERSPRRVRRWPRVTVQPSKFSLDCPSCDMM 
ELRRRYQNLYIPSDFFDAQFTWVnAFPLSRPFQLGNYCNFYVMH 
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SEQ 
ID 
NO: 


Predicted 
beginriing 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sec[uence 


7, l-T . '•^y'^nc concaxning signal peptide 
(A=Alani.ne, C=Cysteine. D=Aspartic AcidT E= 
Glutamic Acid, Phenyl alanine, G=Glycine 
H=Histidine, I^lsoleucine, K=Lysine 
L=Leucine, M=Methionlne, N-Asparagine 
P=Proline, Q-Glutamine, R-Arginine, ' 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, X=Unknown, *«stop 
Codon, /^possible nucleotide deletion 
\«possible nucleotide insertion) 


7012 






KEVESLEKNMKlLUPPUAUHi.YSAKVMLMASPijMEDLYHKSCAL-- 

AEDPQELRDGFQHPARLVKFLVGMKGKDEAMAIGGHWSPSLDGP 

DPEKDPSVLIKT\AIRCCKALTG 


7013 


1 


2661 


AAAAAAAATGTEAGPGTAGGSENGSSVAAQPAGLSGPAEVGPGA 

VGERTPRKKEPPRASPPGQLAEPPGSAGPQAGPTWPGSATPME 

TGIASTPEG\RRTSRRICRAKVEyREMDESLANLSEDEYrSEEER 

NAKAEKEKKLPPPPPQAPPEEENESEPEBPSGVEGAAFQSRLPH 

DRMTSQEAACFPDIISGPQQTQKVFLPIRNRTLQLWLDWPKIOL 

TFEATLQQLEAPYNSDTVLVHRVHSYLERHGLINEVSIYKRIKPL 

PTKKTOKVIirGSGVSGLAAARQl.QSFGMDVTLLEARDRVGGRV 

ri^jwjiN X v-'^WiAMVVTGIiGGKPMAVVSKQVNMELAKIKQKCP 

LYEANGOAVPKEKDEMVEQEPNRLLEATSYLSHQLDFNVLNNKP 

VSLGQALEWrQLQBKHVKDEQIFHWKKlVKTQEELKEtiLNKMV 

NLKEKIKBLHQQYKSASEVKPPRDITAEPLVKSKHRDLTALOCB 

YDELAETQGKLEEKLQELEANPPSDVYLSSRDRQILDWHFANLE 

FANATPLSTLSLKHWDQDDDFEFTGSHLTVRNGYSCVPVALAEG 

IJ>Iia,NTAVRQVRYTASGCBVIAVWTOSTSQTPrYKCDAVLCTL 

PLG^^KQQPPAVQFVPPLPE:WKTSAVQIWGFG1^LNKVVLCFDRV 

FWDPSVNLFGHVGSTTASRGELFLFWNLYKAPILLALVAGEAAG 

IMEHISDDVIVGRCLAILKGIFGSSAVPQPKETWSRWRADPWA 

RGSYSYVAAGSSGWDYDLMAQPITPGPSIPGAPQPIPRLFFAGE 

HTl RN YPATVHGALLSGLREAGRIADQFLGAM YTLPRQATPGVP 
AQQSPSM 


7014 


1 


2661 


KKAGSVKRGEARLFGPTERQSEKPLRPSAARRPEMLSGKKAAAA- 

AAAAiJUU^TGTEauSPGTAGGSENGSEVAAQPA^ 

VGERTPRKKEPPRASPPGGLAEPPGSAGPQAGPTWPGSATPME 

TGIAETPEG\RRTSRRKRAKVEYREMDESLAHLSEDEyySEEER 

NAKABKBKKLPPPPPQAPPEEBNBSEPEEPSGVEGAAFQSRLPH 

DRMTSQEAACPPDIISGPQQTQKVFLFIRNRTLQLWLDNPKIQL 

TFEATLQQLEAPYNSDTVLVHRVHSYLERHGLINFGIYKRIKPL 

PTKKTGKVI I IGSGVSGIAAARQLQSFGMDVTLLEARDR VGGRV 

/4 A c Ki«slM Y VADlrfSAMVVTGLGGNPMAWSKQVNMELAKI KQKCP 

LYEANGQAVPKEKDEMVEQEFNRLLEATSYLSHQLDFimjiJNKP 

VSLGQALEWIQLQEKHVKDEQIEHWKKIVKTQEELKELLNKMV 

NLKEKIKELHQQYKEASEVKPPRDITAEFLVKSKHRDLTALCKE 

^^f.^^^^^^^^°^^^P^SDVYLSSRDRQILDWHFANLE 

PANATPLSTLSLKHWDQDDDFEFTGSHLTVRNGYSCVPVALAEG 

I*DIKLNTAVRQVRYrASGCEVIAVKTRSTSQTFIYKaa!AVLCTL 

PLGVLKQQPPAVQFVPPLPEWKTSAVQRMGPGNLNKWLCPDRV 

FWDPSVNLFGHVGSTTASRGELFLPWWLYKAPILLALVAGEAAG 

IMEWISDDVIVGRCLAILKGIFGSSAVPQPKETWSRWRADPWA 

RGSYSYVAAGSSGNDYDLMAQPITPGPSIPGAPQPIPRLFFAGE 

HT1RNYPATVHGALLSGLREAGRIADQFIK3AM^ 

AQQSPSM 


-J 


3 


3950 

I 

i 
I 

r 

E 
£ 


L>ravGDKIRILATLEDGWLEGSLKGRTGlFPYRFVKl,CPDTRVE 
ETMALPQEGSLARIPETSLDCLENTJbGVEEQRHETSDHEAEEPD 
ZIISEAPTSPLGHLTSEYDTDRNSYQDEDTAGGPPRSPGVEWEM 
?LATDSPTSDPTEWNGlS5QPQVPFHPNLQKSQYySTVGGSHP 
ISEQYPDLl.PLEARrRDYASLPPKRMySQLKTLQKPVLPLYRGS 
5VSASRWKPRQSSPQLHNIASYTKKHHTSSVYSISBRLEMKPG 
WQGLVMEAATHSQGDGSTDLDSKLTQQLIEFEKSLAGPGTEP 
)KILRHFSIMDPNSBKDIVRGSSKLITEQELPERRKALRPPPPR 
>CTPVSTSPHLLVDQMLKPAPPLWRPSRPAPLPPSAQQRTNAV 
PKLLSRHRPTCETLEKEGPGHMGRSLDQTSPCPLVLVRIEEMB 
DLDMYSRAQEELNLMLEEKQDESSRAETLEDLKFCESNIESLW ' 
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ID 

NO: 


Predicted 
beginning 
nucleotide 
location 
corre sponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A-Alanine, C=Cysteine, D^Aspartic Acid, E=. 
Glutamic Acid, P» Phenyl alanine, G=Glycine, 
H=Histidine, I=lsoleucine, K=Lysine, 
L^Leucine, M=Me thionine, Ns=Asparagine, 

Proline, Q-Glutaraine, R=Arginine, 
S=Serine, T=Threonine, v=:Valine, 
W=Tryptophan, Y^Tyrosine, X-Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=poseible nucleotide insertion) 








MELQQLREMTLLSSQSSSLVAPSGSVSAEWPSQRMLEKRAKVIE " 

ELLQTERDYIRDLEMCIERIMVPMQQAQVPNIDFEGLFGNMQMV 

IKVSKQLIiAALEISDAVGPVFLGHRDELEGTYKIYCQNHDEAIA 

LLEIYEKDEKIQKHLQDSLADLKSLyNEWGCTNYINLGSFr.IKP 

VQRVMRYPLLLMELLNSTPESHPDKVPLTNAVIAVKElNVNrNE 

YKRRKDLVLKYRKGDEDSLMEKISKLNIHSIIKKSNRVSSHIiKH 

LTGPAPQIKDEVFEETEKNFRMQERLIKSPIRDLSLYLQHIRES 

ACVKWAAVSMWDVCMERGHRDLBQFERVHRYISDQLFTNFKER 

TERLVISPLNQLLSMFTGPHKLVQKRFDKLLDFYNCTERAEKLK 

DKKTLEELQSARNNYBALNAQIjLDELPKFHQYAQGLFTNCVHGY 

AEAHCDFVHQALEQLKPLLSLLKVAGREGNLIAIEHEEHSRVLQ 

qlqvftffpeslpatkkpferktidrqsarkpllglpsymlqse 
elrasliiaryppekiipqaernfnaaqdldvsllegdlvgvikkk 

DPMGSQNRWLIDNGVTKGFVYSSPIiKPYNPRRSHSDASVGSHSS 
TESEHGSSSPRPPRQNSGSTLTFNPN\S\MAVSFTSGSCQKOPQ 

daspppkewdqqtlsaslnpsnsesspsrcpsdpdstsqprsgd 
sadvardvkqptatprsyrnfrhpeivgysvpgrngqsqdlvkg 
cahtaqapedrstepdgseaegnqvyfavytfkarnpnelsvsa 
nqklkxlepkdvtgntewwuljevngkkgyvpsnyrrkteyt 


7015 


1842 


513 ■ - 


rqawhewaapswrgarlvqsvlrvwqvgphvarervipfssll 
gfqrrcvscvagsapsgprlasasrsngqgsaldhplgpsqpds 

SVTPCVPAVSMflRDEQDVLLVHHPDMPENSRVLRWLLGAPNAG 
KSTLSNQriLGRKVFPVSRKVHTTRCQAliGVITEKETQVILLDTP 

giispgkqkrhhlslslledpwksmesadlvwlvdvsdkwtrn 

QLSPQLlJiCXTKYSQIPSVLVMNKVDCLKQKSVLLEbTAALTEG 

wngkkiikmrqafhshpgthcpspavkdpntqsvgnpqrigwph 
fkeipmlsalsqedvktlkqylltqaqpgpweyhsavltsqtpe 

EICflNIIRBKLLEHLPOEVPYlTVQQKTAVWEEGPGGELVIQQKl, 

lvpkesyvklligpkghvisqiaqeaghdi^diflcdvdirlsv 

KLhK 


7016 


167 


2513 


ILNAPKPPPPRDSVSAVAAKRDTGGGSWGTGMDVSGQETDWRSr* 

AFRQKLVSQIEDAMRKAGVAHSKSSKDMESHVFLKAKTRDEYLS 

IiVARLlIHFRDIHNKKSQASVSDPMNALQSLTGGPAAGAAGIGM 

PPRGPGQSLGGMGSLGAMGQPMSLSGQPPPGTSGMAPHSMAWS 

TATPQTQLQIiQQVAAAAAAATARSSSSSSRRRYSSSSSSSKSKQ 

fqaqqsamqqXqfqaVwqqqqqlXqqqqqqqqhliklhhqk-qq 

QrOQQQQQLQRIAQLQIiQQQQQQQQQQQQQQQQALQAQPPIQQP 
PMQQPQPPPSC3ALPQQLQQMHHTQHHQPPPQPQQPPVAQNQPSQ 
LPPQSQTQPLVSQAQALPGQMLYrQPPLKFVRAPiyrVVQQPPVQP 
QVQQQQTAVQTAQAAQMVAPGVQVSQSSIiPMLSSPSPGQQVQTP 
QSMPPPPQPSPQPGQPSSQPNSJHVSSGPAPSPSSFLPSPSPQPF 

Vqspvtartpqnfsvpspgplntpvnpssvmspagssqaebqqy 

LDKLKQLSKYlEPLRRMINKIDKNEDRKKDLSKMKSLIiDILTDP 

skrcplktlqkceialeklkndmavptpppppvpptkqqylcqp 
lldavlanirspvfnhslyrtfvpamtaihgppitapwctrkr 
RLEDDERQS I PS vlqgevarldpkflvnldpshcsnngtvhli c 
klddkdlpsvpplelsvpadypaqsplwidrqwqydanpflqsv 
hrcmtsrllqlpdkhsvtallntwaqsvhqaclsaa 




7017 


1 


1785 

i 


IWLGNTCYMHSVI *ALPMATDFRRQVI.SLNLNGCNSLMKKLQHL 

paplahtqreayapr I fpeasrppwftprsoodcseylrflldr 

IiHEEEKILKVQASHKPSEILECSETSLQEVASKAAVLTETPRTS 
[XSEKTLIEKMFGGKLRTHXRCLNCRSTSQKAEAF'IDLSIiAFWPS 
YSLEYMSCPDCSQSPSIQDGGLMQASVPGPSEEFWYWPTTAAP 
I cdslvnektigsppnefycsekts VPNESNKILVWKDVPQKPG 

3ETTPSVTDLI,NYFIiAPE ILTGDNQYYCSNCASLQNAEKTMQIT 
SBPBYhZhThLRFSYDQKYHVRRKILDNVShPLVLELPVKRlTS | 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue o£ 
amino acid 
sequence 


Predicted end" 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal pepETS^ 
{A=Alanine, C=Cysteine, D«Aspartic Acid, 
Glutamic Acid, P=Phenylalanine, G=Glycine ■ 

H=Histidine, I-Isoleur-i no v t,,^j-, 
* " i £>*jieucine , it^liysine 

L=Leucine, M=Methionine, N=:Asparagine 

P=Proline, Q=Glutamine, R=Arginine, 

S=Serine, T=Threonine, V=Valine, 

W=Tryptophan, Y==Tyroeine, X=Unknown, *=Stop 

Codon^ /=possible nucleotide* et^i^*-^^-^ 

_ X^possible nucleotide insertion) 


7018 






t SSL.SES VDh'TDLS ENLAKKLKPSGTDEASCTKLVPYLLS~ 

SWVHSGISSESGHyYSYARNITSTDSSYQMYHQSEALALASSQ 

SHLLGRDSPSAVFEQDLENKEMSKEWPLFKDSRVTFTSPOSVOK 

ITSRPPKDTAYVLLYKKQHSTNGLSGNNPTSGLWINGDPPLQKE 
LMDAITKDNKLYLQEQELNARARALOAAqA«5rcc'OT5wric.r%r,xTTs,, 

PGSC6PTGGGGGG3FNTVGRLVP 


7019 


484 


1066 


SI.VFKGMTWSGh:AGHHCSALFNLAAYHQLFVGTERIRAPEIIFQ 

PSLIGEEQAGIAETLQYILDRYPKDVQEMLVQNVFLTGGNTMYP 

GMKARMEKELLEMRPFRSSFQVQLASNPVLDAWYGARDWALNHL 

DDNEVWITRKEYEEKGGEYJ^KEHCASNIYVPIRLPKQASRSSDA 
QASSKGSAAGGGGAGEQA 


7020 


1048 


335 


APGGFLVTMVFPAPSPPWMLUCCSHEVTAGPPTLUKUMSALVAA 

^^?n^:Ji^?^^"^^^''^^^^°^T^^'^^THHDRKNGRS 

SSGALRGVCSCVEAGKACDPAARQFNXLIPWCLPHTGNRHNHWA 

GLYGRLEWDGFFSTTVTNPEPMGKQGRVLHPBQHRWSVRECAR 

SQGFPDTYRLFGNILDKHRQVGNAVPPPLAKAIGLEIKLCMLAK 
ARESASAKIKEEEAAKD — ^i^i^^si^t-MlAK 


7021 


1 


2154 


l^ADSKRKSVLLDKIKNLQVALTSKQQSLETAMSFVARKTFKRVR 
NGFLMRKVAVPPSNTPTRASPQLREAVLKLSDAGITPLFLTRQE 
DRQLINALQ INNTAVGHALVUPAGRDLTDFLENVLTCHVCLDI C 
NIDPSCGFGSWRPSFRDRRAAGSDVDIDMAFILDSAETTTLFQF 
wSHKKyiAYLVRQLDMS PDPKflSQHFARVAWQHAPS ES VDNAS 
MPPVKVEFSLTDYGSKEKLVDPLSRGMTQLQGTRALGSAIEYTI 

s if V vxxsiGRKVUrKBVYTFASBPNDVFFKLVDKSTELNEEPLMR 

PGRLLPSFVSSENAFYLSPDIRKQCDWFQGDQPTKNI,VKPGHKO 

WVPNNVTSSPTSNPVTTTKPVTTTKPVTTTTKPVTTTTKPV'TT 

INQPSVKPAAAKPAPAKPVAAKPVATKTATVRPPVAVKPATAAK 

PVAAKPAAVRPPAAAAAKPVATKPEVPRPOAAKPAATKPATTKP 

MVKMSREVQVPEITENSAKLHWERPEPPGPYPYDLTVTSAHDQS 

L^^KQNLTVTDRVIGGLLAGQTYHVAWCYLRSQVRATYHGSFS 
TKKSQPPPPOPARSASSSTTKTT MVQT'wnT 7.r rn™mr>-r^Y>* 

TCRDFILKWYYDPNTKSCARFWYGGCGGNENKPGSQKECEKVCA 
PVLAKPGVISVMGT w^^n-Avuft 




7022 


2 


338 


VNAVSFFPNG7AFAlXjaDDATCRLFD£iRADQELU.YSHDNIICG ' 
ITSVAFSKSGRLLLAGYDDFNCNVWDTLKGDRAGVIAGHDNRVS 
CLGVTDDGMAVATGSWDSFLRIWN v-vi^^«uNRVS 




7023 


2 


856 


VYIGSFWSHPLLIPDWRKLFEAEEQDT.RpnTnQT ddxt^^t rtwr^T — 

DLIKRARIAKVHAYIISSLKKEMPSVPGKDNKKKELVNNLAEIY 
GRIEREHQISPGDFPNLKRMQDQLQAQDFSKFQPLKSKLLEWD 
DMIAHDIAQIJWLVRQEESQRPIQMVKGGAFEGTI.HGPFGHGYG 
HX3AGEGIDI}AEWVVARDKPMYDEIFYTr,SPVDGKiaX3ANAKKEM 

VRSKLPNSVIKSKIWKLADIDKDGMLDDDEFAIANHLIKVKLEGH 
ELPNELPAHLLPPSKRKVAE 




7024" 


2 


748 

] 
] 


w V V F i V pg i: XKKTQNADGFSTYVCLVLL VANIIjR I LF 
WPGRRFESPLLWQSAIMILTMLLMLKLCTEVRVANBLNARRRSF 
rAADSKDEEVKVAPRRSFLDPDPHHFWQWSSFSDYVQCVLAPTG 
7AGY1 rYI>SIDSALPVBTLGFLAVLTEflMLGVPQLyRNHRHQST 
SGMSlKMVI^WTSGDAFKTAYFLLKGAPIiQFSVCGLriQVLVDLA 
CLGQAYAFARHPQKPAPHAVHPTGTKAL 






1207 


190 ■ ■ f 
£ 
C 

i; 

F 
V 


aiUVTGyVAQVWMFGGGGVIiSSGEQIiQMPVRPKi^GLGPSPGWLV 
ISRRGSPGTVLGLPFWLLTPVLVSRSIRSMLLLTRSPTAWHRLS 
JLKPPVLPGTLGGQALHLRSWLLSRQGPAETGGQGQPQGPGLRT 
iLITGLPGAGLGGAWLALRAEKERLQQQKRTEALRQAAVGQQD 
■HLLDHRGRARCKADFRGQWVLMYFGFTHCPDICPDELEKLVQV 
RQLEAEPGLPPVQPVFITVDPERDDVEAMARYVQDFHPRLLGL 
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SEQ 
ID 
NO: 


Freaicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end" 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal peptlHi- 
(A:=Alanine, C=.Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G^Glycine 
H=.Histidine, I=Isoleucine, K=Lysine, 
L^rLeucine, M=Methionine, N=Asparagine, 
PtProline, Q=Glutamine, R=Arginine, 
S5;Serine, T=»Threonine , V«Valine, 
W«Tryptophan, Y=Tyrosine, X=Unknown, *«stop 
Codon, /^possible nucleotide deletion, 
\*=pos sable nucleotide insertion) 


/025 






TOSTKQVAQA5HSYRVYYNflGPKDEDQDYIVUHSlAlYLLNPDG 
LFTDYYQRSRSAEQISDSVRRHMAAFRSVLS 


7026 " 


232 


832 


KQVIRTCEKRPTFNQHTVFNLHQRLNTGDKLNEFKELGKAPISG 
SDHTQHQLIHTSEKFCGDKECGNTFLPDSEVIQYQTVHTVKKTY 

ECKHCGKSFSLRSSLTGHKRIHTGEKPFKCKDCGKAFRPHSOLS 
VHKRIHTGEKSYECKECGKAFSCG 


7027 


328 


1146 


w*'XSifSit5DIKDIKKAAKSMLDPAHKSHFHPVTPSLVFLCPIPDG 
v\j vi^ivi\ot^i V •'^wajxtc,KUIFYASRFKDMPNFIAI>EK 
SSVLRHCCDLLIGVAAGSSDKICTSSLQVQRRPKAMMASIGRLS 
HGESADLLISCMAESAIGWISSRPWVGELMFTFLPGDFESPLHK 
LRKSS*LPRKHR*QPINAVRMF3uDQCMDGSIALRAIVSEIPVFB 
,*wj.\»cij.r vwuuiijir'i'iiywGAVTTNVPKLSNSGKLLG 
QDEQPHIFG 


7028 


43 


954 


ORiU.QDQQRi'j!;miiUGAEGGGKRGEAGWiiGUS;i?EIVKBNKLPEH " 

YYQELKIVPEGEWGQFMDAIiREPLPATLRITGYKSHAKEILHCL 

KNKYFKELEDLEPTOGQKVEVPQPI^WyPEEIAWHTNLSRXILRK 

SPHLEKFHQFLVSETESGNISRQEAVSMIPPLLLNVRPHHKILD 

MCAAPGSKTTQLIEMLHADMNVPFPEGFVJANDVDNKRCYI^VH 

QAKRLSSPCIMWNHDASSIPRLQIDVDGRKEILFYDRILCDVP 

CSGDGTMRKNlDVWKKWTTLNSLQLHGIiQLRIATRGAEQL 


7029 


18d 


608 


OKJr'yyiijyEPGTMVEKGSDSSSEKGGVPGTPSTQSLGSHNyXRNS 

KKMQSWYSMLSPTYKQRNEDFRKLFSKIiPEAERLIVDYSCALQR 
ElIjLQGRLYLSENWiCFYSMrPPMPTTTQTnT vmrn^ri-r vit-ct™* 


7030 


1343 


40 


VI^SNTEAKQATGTSSKLRHGTGQEKGRBGPRCPSQIAQLRLWG 

/PCPHAGRETGPRASAPIPGS*GHGWHW*RKDGRGERSEGPSAr. 

SPHSPSLLNMQQAPTHVGPGMGSQRPRSSWPEQVGVGSQLSRE 
RWRA* RS LPGAAAS ERTEMTKERS P / R prnr vn c omui tpoi^ r^i^rr 

TRKRNSRRNTMVSR6GGCIiLyPLQSIMPE*QLR*GAHASPPTQG 
R*GKGGPRSPLTKASGTTHIPTPPPGSIP/RPTRDSGPGTDrTS\ 
AAPGQKRGHREA*QGPEPV/WGRVTTHLQGPAG*TKPLGS\RNW 
VPGPAEGEQGEGAGi:,EGRP*PLKGCRSTLTFSPQLSlPMVGKKP 
PEGTTASFFP\RSCHSE*RKPPPSCPHAPALSLPHPLPLPLPPL 
PLPLPGAGT*HSARSGRPGQSETGSLCHNCHHCPPHCPKCSPGG 


7031 


2 


521 


FVCFSAPGSGUGGKRRVKMELSAVGERVFAflEALLKRRIRKGRM 
EYLVKWKGWSQKYSrWEPEENIbDARLLAAFEEREREMEI.YGPK 
KRGPKPKTFLrJCAOAKAKAKTYEPRSDSARGIRIPYPGRSPQDL 
ASTSRAREGLRN\RVCPRQRAAPAPAAP\PRRGPSGPGPRPG*G 
PGLHPPGPGGPSKHGFVPASEQHQHQOHLPRRGPSGPGPRPr, 


7032 ■" 


960 


59 


il^JbVPGAEWPRKPPAQICPQLTSRPHI^SSPRSLSPGCGHSPGPG 
/CKPS/RHCDELHEGPSRTAAIiPCGKPQPKHGVEECG/PCPCIA 
PRRLTEPPALTVSPVGRAAPSGAL*PSGRACSACSHRIiAPEAAL 
SAAAPRPSIiGSGQNASGLPAASLPPQDSSQPHKTVPSPARSVPP 
LGAQARAAPPRLWCPRALVSG* EASPEAVSVAAGPPVPGPTPST 
SGSTASHSRRGC*SPR*TPAPPRRDHGRSAAPEVLTAAASAQPC 
?^QGGPRPTGAGRTPSPLGLPPSRGPPAASARPFCRHPSI, 


7033 


1393 
683 


2104 

] 
I 

815 I 


KRPGRTEPVEPPPVPPPPRASNSKSRCR*KNLHIAPi,*QSPLRK 
SRQIGTSSLPFGRSAGERPRPAATFCl.SRGGSSPVFL*PSSSSL 
^PmKRQFGRLHShFWKSmKmSFLLTPKLDTSUnSGmYRQR 
[»PRLHTFLKKSLQMASELAPPLPTPAPLASSLPPPPGPPPLLPV 
5LA*riSRSGILVPPNSGFSLSC\PLGDH*GgSGEVRGSCGSPPP 
niCWVI,PPPP*Ij:,LPPR 

iSRDCLSSSAtS«RARRSKCSGPKRATPLDSGP6P*APPGPSSA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 

nucleotide 

location 

to first 
araino acid 
residue of 
araino acid 
sequence 


Amino acid aegment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E== 
Glutamic Acid, F=Phenyl alanine, G=Glvcine 
H=Histidine, I=Isoleucine, KsLysine, 
LsLeucine, M=Methionine, N»Asparagine ^ 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
_ \°po5sible nucleotide insertion) 


7034 






liPaMPSSCPWRTGALGPSPAGSRALGRCTSSVGPGSRWLTRTSSP 
GCATRTWRTMRMEPRPLRSRMGESAPGIPAELPSAAPSGPSAPS 
AAAPSAPTTPAAAGPWTL*SRRTAEWCWPPSCSCCWGWC*SWSA 
WDWRRPPLQVSPAPSSSCRASCCWCLESIT*SSSTARSRATGAS 
SSSTCPTSRSDRGAAWTP\SPMGAPLLPCSVP]:,ISREEALQDPR 
NPS P *GVCSGSSGHAGIAIjGKPPVACS VP 




92 


1942 


EDTSSMPFRLXilPLGLLCALLPQHHGAPGPDGSAPDPAHYRERV 
KAMFYHAYDSYLENAFPFDELRPLTCDGHDTWGSFSLTLI0ALD 
TLL\TLFYFQILGNVSEFQRWEVLQDSVDFDXDVNASVFETNI 
RWGGLLSAHLLSKKAGVEVBAGWPCSGPLLRMAEEAARKIiLPA 
FQTPTGMPYGTVNLLHGVNPGBTPVTCTAGIGTFXVEPATIiSSL 
TGDPVFEDVARVALMRLWESRSDIGIiVGNHIDVLTGKWVAQDAG 
IGAGVDSyFEYLVKOAILLQDKKLMAMFLBYWKArRUrYTRFDDW 
YLWVQMYKGTVSMPVFQSLEAYWPGXiQSLIGDIDNAMRTPLNYY 
TVWKQFGGLPEFYNIPQGYTVEKREGYPLRPELIBSAMYLYRAT 
GDPTLLELGRDAVESIEKISKVECGFATIKDLRDHKLDNRMESP 
PLAETVBCYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGYI 
ENTEAHPIDPAALHCCQRLKEEQWEVEDLMREFYSLKRSRSKFQ 
KNTVSSGPWEPPARPGTLPSPENHDQARERKPAKQKVPLLSCPS 
QPPTSKIALLGQVFIiDSS * PLDNFFI PrPLRLNYNKLLXiAIIKK 
K 


7035 
7036 


92 


1942 


EDTSSMPFRLLI PLGLLCALLPQHHGAPGPDGSAPDPAHYRERV 
KAMFYHAYDSYLENAFPFDELRPLTCDGHDTWGSFSLTLIDALD 
TLL\TLFYFQILGNVSEFQR\A^EVLODSVDFDlDVNASVPE'nTI 
RWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKLLPA 
FQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEPATLSSL 
TGDPVFBDVARVAIiMRLWESRSDIGLVGNHIDVLTGKWVAQDAG 
IQAGVDSYFEYLVKGAILLQDKKLMAMFLEYNKAIRNYTRFDDW. 
YLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYY 
TVWKQFGGLPEFYKriPQGYTVEKREGYPLRPELIESAMYLYRAT 
GDPTLLELGRDAVESIEKISKVECGPATIKDLRDHKLDNRMESF 
FriAKTVJCYLYLLPDPTWPIHNITGSTPDAVrrPYGBCILGAGGyi 
FNTEAHPIDPAALHCCQRLKBEQWEVEDLMRBFYSLKRSRSKFQ 
KNTVSSGPWEPPARPGTLPSPENHDQARERKPAKQKVPLLSCPS 
QPFTSKLAUC^VFLDSS * PLDNPF I FI FliRLNYNKLLIiAI IKK 


7037 


442 


761 


CLAPLFSCFQIINLHIiAPSGRLRWAWLRGPGRN*LPGEGPSIPT 
RNW*BRKAGCSQPC/PAQQHHGRPPGVSPLPRDPHPTrLRPLPP 
PPPPPPPPPRRPPRKRRPG 




442 


761 


UUiPLFSCPQllNLHLAPSGRLRWAWLRGPGRN*LPGEGPSIPT 
RNW*BRKAGCSQPC/PAQQHHORPPGVSPLPRDPHPTTLRPLPP 


7038 


IBS 


891 


GAGAASDMSSGLRA/^FPRWKRHISEQLRRRDRLQROAFEEIIL 
QYNKLLEKSDLHSVIiAQKLQAEKHDVPNRHEISPGHDGTMNDNQ 
LQEMAQLRIKHQBELTELHKKRGELAQ\RVIDLNNQMQRKDREM 
QMNEAKIAECLQTISDLETECXDIiRTKLCDLERANQTLKDEYDA 
LQITPTALEGKU?KTTEEWQELVTRWMAEKAQEANRLWARE*KR 
LQEAASPAAERACRSSKGTSTSRTG 


7039 


155 


891 


UAGAASDMSSGLRAADFPRWKRHISEQLRRRDRLQRQAFEEill, 
QYNKLLEKSDIiHSVriAQKIiOAEKHDVPNRHEISPGHDGTWNDWQ 
LQEMAQLRIKHQEELTELHKKRGELAQNRVIDLKINQMQRKDREM 
2MNEAKIAECLQTISDLETECLDLRTKLCDLERANQTLKDEYDA 

lqitftalegklrktteenqelvtrwmaekaqe:anrlnarb*kr 
lqeaaspaaeracrs skgtstsrtg 


704 0 


34 


789 1 
< 


<:i.rPPRRPHR(JS5GHGSDNSSVLSGEIjPPAMGKrALFYHSGGSS 
3YESVMRDSEATGSASSAQDST5ENSSSVGGRCRSLKTPKKRSW 
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SEQ 
ID 
MO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptlHi" 
(A=.Alanine, C^Cyateine, D=Aspartic Acid E- 

H-Hi^MH-''''^'^; f-^^^^i^l^l^nine, G=61yci;er 
H-Histidine, l«Isoleucine, K=hys±ne 
L=Leucine, M=Methionine, N=Asparagine, 
PsProlme, Q=Glutaraine, R=Arginine 
S=Serine, T^Threonine, V^Valine, 
WoTryptophan, Y-Tyrosine, X==tJnJcnown, *=stop 
Codon, /^.possible nucleotide deletion 
\=possible nucleotide insertion) 


7041 






i^CiSQRRRl,lPALSi,DTSSPVRKPPNSTGVRWVD(JPLR5SPRGT:G— 
YLEALBCVTERLESRVWFCKAHrj^MITCFDrT 


7042 


1 


567 




7043 


7 


345 


_ _DDLU;4AEVAlSDyGDKLTLB^ i^wbA&w 


1 

7044 


2 

276 


2170 


^tirwU^DSDt:;bKOLVi:;yGTGLEPtBEGERPKKPiPLQDCyrVR^ 
KSVLGPEDPMDEEDLSSFGIAPKAIVTTDDFASKTKDRIREKAR 
^ ^ *^*'*^^'''^-''*^'-ai»J^PG5SEGSEGEDDDYLPDNVt'P 

KYDTVLKDEEPaDOLYGWTapRQYKNQKESKKDLRYVGKlEDSF 
SIJ^KPLSSKKIYPPPELPRDYRPVHYFHPMVAln^CPMourT^,, 

i^esagkatpdpgthsk^;Sk^S^S^sats^^ 

LSOKDKBRIKEMKOATDLKflAtJLKAMIAQIIAQSSMQPSPiAA 
AGHCSWNMALGGGTATLKASNPKPFAKDPEKQKRyDEPLVH^ 
GQKDALERCIJDPSMTEWERCaiERDBFARAALLYASSHsi^Tq^o; 

HPDKLLFQ/RLVGLPRVKRDKYSVPNPLTLPETASLPTTOAS^ 


'>045 ■ 


3 


734 


cvi:i.TDEFAKGRICVAEI.YELVQYAGNIIPKL^JLL.I.TVG\ArtrVKS ' 
FPQSRKDILKDLVEMCRGVQHPI.HGLPLRNYLLQerRNILPDEG 
EPTDEETTGDISDSMDPVLLNFAEMNKLWVRMQHQGHSRDRPKR 
ERERQELRIlVGTNLVRtSQV '^"VRMOHOGHSRDREKR 


7046 


J 


513 


i:^«:!^^^'^^^"^^'^°™PyiTSIflDI.TGQVALYTPCP 

MEFQUffiPFIiYRNASLSIYSIWFYDrarocHRIAKLMADVVEE 
ETRRSQQA/RSGQTESQPGQMLQRPQAHRHPGDAS-nQn/' 


7047 


103 


513 


^rr|«ii^RAGaU»lSLAALKQHDPYITSIAULl'UUVAI.YTFCP 
K^QWEKTDIEGrLPVYRRSAaPYHGPTIVNRLMMffilLVEPVNK 
Si;nf^iiP'!^"'™^^^^SIWFyDKNDC31RIAKLMADWEE 
BTRHSQQA/HSGQTESQPGQMLQREQAHRHPGnABORr,^ 




92 


486 


^ J. y J. <3-*--i-wvyvtCtojS.J\/iijJJ^iAISIjGFHflVYRT? 


7049 




g27 

; 

I 


^^i.rL.L«SWDYRHHA-lRRVISSPV? ^ l-ibUbGKWSSKliEEAmr 
«miAMTYKQRAEOTQBBLREFQEGSREYEAELETQLQQIE^ 

^m-sennrlrmeletikekfevqhsegyrqisaledSaq^ 
:kdqlqkyirei,eqanddlerakratdhglsktpe\qrlh\qm 


r 7050 


393 ■ 

jy3 


938 • B 
\ 

E 

9sa K 


«'ii^Sita:tuupPPG];X3GPA'rXASVAGR(iSSVGKlPARRCYEDEL- 
^EAVGRIYBLRI^IMDFDGKNRGYAPVMYC^^S^I^ 
OTBIRPGRU/SVCCSVDNCRLFIGOIPKMHCREBIL^AK^ 
OTLDVIVYASAADKMKNRGLRLRGVREPPRGCHWLGRKI.IAWX 

KXt;SASrGaPPPUUjGPATXASlrAGRCSSV6KIW>RRCYE0BI. 
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SEQ 
ID 
NO: 


Predicted 
beginning 

nilol(»nt- ■( 
* A tsodoe 

location 

corresponding 

to first 

amino acid 

residue of 

amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
seqiience 


Anuiiu aci.a segment contai.ung signal peptidi" 

CA=Alanine, C^Cysteine, D^aDal-^?r^ a^Tx ^ 
. - . - ' ^---^'S/ i-'^Asparcic Acid, E= 

H=.Hist:Ldine, I=r8oleuci„e, K=Lysine 

P=Proline, Q=eiutandne, R=Argiai„| ' 
S=SBrane, T=Threonine, V=Valine, 

Codon, /.possible nucleotide deletion, 
\=poss).ble nucleotide insei-Unn) 


7051 


119 






7052 
7053 


467 


816 
715 


^^^^^^^^^ 


7054 
7055 


1 
2 


715 
1036 


VQQLRQAIEECKOVILELPEQSEKQKDAVVRLIHS^O^n 

dyrcaecrapi/cs/dgwpsSd??^oSc^^ 


70SS~~ 
7057 


2 


527 

527 


f^^°"'°''*^'^SRASASVLIttNCSLH,I.PMCRTLIAY^G 

sqkvpsrrtrrlldksrtfhitcgaticipsg^Sh^^ISn 
fsvnyssdpvelnaaryrdbdprkllfttvIg^^J^^^ 

EYHYI,H«MIX./ALci:s^^S§^™S2^G 
FSVNYSEDFVESLNAAKVRDBDPRKLLPTrVPoil^I^S^ 


70^8 


1 


431 

r 


^r^';^c^^'"^'''"^'^^"°^°™'«g«^^^^'^HKJJUKLLHASCQA' 
n™^^'^^'^''^°^^'"™^QKVHTGEKPYKCSH™K3SsL 

dkhqrshi.3kkpfs*pvtklsppisisqpshkntqlhSS . 


705S 




4^9 

J 
1 




7oeo ■ 




117S V 
I 

\ 
F 

Q 
T 

1670 s 
Y] 


?PAFPhUPAAAAI>UJAU.GTGPRRAKUCLGAaGPTSijGPAAPTPa' - 
^WARPSAWLECVCWTFDLBLGQALEL^^FR^^^I^t 

M,QREPAHyFGYVYFRQVKDSSVlCRGYFQKSI,VLVSRLPFVRf 
QALLSLIAPEYFDKIAPCLBAVCSEIDQWPAPMG^TiSi; 
WVQVRIPSRVDKSESSPPKQFDQENlipAPVVL^^D)^ 

CPRPVLTHMQTLWELMLLGBPLLVLAPSPDV^^STSa 

LQHWPHILRVGBPKMSGDLPKQVKLKKPPKV.RPMr,TK-n 

;"^f''^"^'^^^^'^'^''^EPLKGSPKAKl]GHIEYKKLWPSQ - 
RFEHUVTOMKMRLOEGRQEAVYQIGVEDHGLLVGLAEEE^ 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide" 
(A^Alanine, C^Cysteine, D»Aspartic Acid, E= 
Glutamic Acid, Fe^phenylalanine, G=Glycine, 
H=Histidine, Islsoleucine, K=I*ysine, 
L=Leucine, M=Methionine, N^Asparagine, 
P=Proline, Q=Glut amine, R=Arginine, 
S==Serine, T=Threonine, VsValine, 

"jrc'^^f A -* *yj-woxi«; , A^unKXiOwn, —Stop 

Codon, /^.possible nucleotide deletion, 
\«poesible nucleotide insertion) 








LKTLHRMABKVGADITVLREREVDYDSDMPRKITEVLVmCVPDN 
QQFLDLRVAVLGKFVDSGKSTLLGVLTQGELDNGRGRARLNLFRH 
LHEIQSGRTSSISFEILGFNSKGEVHGINGTQWGQTLRMGW* * * 
Ki uijijfKvwitijtf ti v*MN/UjRGL*TSSAPliRKSMGNQIiN*IKNG 
VKl KRQGHPGNGLGPGNSEGVGRAGRRH*GPWALGQWNYSDSR 
TAEElCESSSKMITFIDLAGKHKyiiHTTIFGLTSYCPDCALLLV 
SANTGI AGTTRBHIiGLALALKVPFFl WSKI DLCAKTTVERTVR 
QLERVLKQPGCHKVPMLVTSEDDAVTAAQQFAQS PNVTPl FTLS 
SVSGESl»DI*IiKVFLNILPPI»TNSKEQEELMQQLTEFQVDEIYTV 
_ fgytji vvutjT.ijbR*lDiJATLPTQPSPIYSKTSWPKGGDPGI 


7061 


364 


710 


armpsplgppclpvmdpettleepetarlrfrgfcyqevagprb" 
alarlrelccqwlqpeahskeqmlemlvleqflgtlppeiqawv 

RGQRPGSPEBAAAIjVEGLQHDP*ARMPSPLGPPCXiPVMDPETTL 

eepbtarlrfrgfcyqevagprealarlrelccqwlqpeahske 

QMLEMLVLEQFLGTLPPEIOAWVRGQRPGSPEEAAALVEGIiQHD 
PGQLLG 


7062 


71 


744 


AKAGTNLERLkwLSyFFCIPKHKLKSSQKDKVRQFMACTQAGBR 

taiycltqnewrldeatdsffqnpdslhresmrnavdkkklerl 

YGRYKDPQDBNKIGVDGrQQPCDDLSLDPASISVLVIAWKFRAA 
iV'-tfl' ri-«JGNTfiliQCDSMEKLKA^ 

PYQPTFTPAKITPGQKGtiDL*MflGAYWKLVLSGRPKFLYLWNTFI, 
MEKH 


7063 


2 


562 


U;TVPDLPGRRFRAMRTGQRR*PELPPDMMSLE(5SglfLKAFERR 
LTEYIHCLQPATGRWRMLLIWSVCTATGAWNWLIDPETQKVSF 
c iauwiNrti^jj jf I x^t^x liiiVjljj? FAGIHKRWAPSIIAARCRTVLA 

EYNMSCDiyrGKLlLKPRPHVQ*QSSLIVMGLKIAFLRlSDTAKS 
HKGFLIiRLDM 


70S4 


300 


8S4 


i. ^aiipwroo 1. 1 ji^Ailf' X Ao PHPSRGTCPPASS AS 
SRRTGCWTCPPESGHAQARRSRRASASRWGARGAVRSAVAARGC 
SSRAGRWLETPGRRRGP PACAAAAGRLRGPAP * AAPPTAS VPAR 

CRCPAARTGAPAAATWLRRRLSGLRAPALGRRRSPGPSPKSAAP 
PLLTPLGAGRAGGSRANS 


706S~ 


1 


5S5 


ATTTHSARRSGRGAAAEAAASAAGGRQKGPDRKAWEGRRTTPGG 

RSQSEPKAPPPQKRSEAAFASMAHSPVAVQVPGMQNNIADPEBL 

PTKliERTGK'flSP^^Pl/KK'nT'nTJDT'rsAin/BT VT mr otTi-n wi-iT^-rT^T^.i- 
^ i*-«JUivj.^j(X.V7(i3i: VJO V e J\.vHX'JL>lK.lUW V VAXKx X JJijBBAEDEIEDI 

OQE I TVLSQCDSS YVTKYYGS YLKGSKLWIIMEYJU3GGSALDLI* 
RAGPFDEFQ 


706S 


356 


676 


PGPQRGPWRAREGGHPLDPADHPRAPASLRSNVRAATMMQICDT 

YNQKHSLPNAMNRFIGAAraKMDQTVMVPSLLRDVPtiTyOPGLDND 
VG VE VGGfiGG CIi PPPTTJ "O 


7067 


152 


973 


KENITMATEIGSPPRFFHMPRFQHQAPRQLFYKRPDFAQQQAMQ 
QLTFDGKRMRKAVWRKTIDYNPSVIKYLENRIWQRDQRDMRAIQ 
PDAGY YNDLVPP I GMriNNPMWAVTTKFVRTSTWKVKCP VFWRW 
TPEGRRLVTGASSGEPTI/WNGLTFNFETILOAHDSPVRAMTWSH 
NDMWMLTADHGGYVKYWQSNMNNVKMFQAHKEAIREARFIHNIP 

FSWPIVMVKLPSKCILGAEMHGLCQFLGNFLHPINTIFFFVFT 
HSPFCWAPP 


7068 " 


222 


816 


DTMKBYVriLLFIALCSAKPFPSPSHIAIiKNMMLKDMEDTIJDDDD 
DDDDDDDDDDEDNSLFPTREPRSHPFPFDLFPMCPFGCQCYSRV 
VHCSDIiGLTSVPTNIPFDTRMLDLQNNKIKEIKElTDFKGLTSLY 
GLILNNNKLTKIHPKAPLTTKKLRRLYLSHNQLSEIPLNLPKSL 
AELRIHENKVKKIQKDTPKKK 


7069 


114 7 


1765 


FRDHRRYFYVNEQSGESQWEFPDGEEEEEESQAQENRDETLAKQ 
TLKDKTGTDSNSTESSETSTGSLCKESFSGQVSSSSLMPLTPFW 
TLLQSWVPVLQPPLPLBMPPPPPFPPBSPPPPPPPPPAPKMPPP 
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SEQ 
ID 
MO: 


fredictecl 
beginning 
nucleotide 
location 
corre spondi ng 
to first 
amino acid 
residue of 
amino acid 
seofuence 


I'reaicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino ac:a.a segment containing signal peptigT" 
{A=Alanine, C=Cysteine, D^Aspartic Acid; E= 
H ^^^^^^yl^l^nine, G^Glycine, 
H=Histidine, I==l3oleucine, K=,Lysine, 
L=Leucine, M=Methionine M-fl<?nav-arr-{ «^ 
P!=Proline, Q=Glutamine, R=Arginine, 
S-Serine, T= Threonine, V^Valine, 
W-Tryptophan, Y^Tyrosine, X=Unknown, *=stop 
Codon, /^possible nucleotide deletion 
\=possible nucleotide insertion) 


7070 






_ RVSTAQKRIEEWKQQOLVSGMAERigaNPEA ^^^t!.iiD 


7071 


I 


547 


EIJiiaCRKQKKRDAIJV^HEPPPEPEEriGPVDEETFLKAAVE^^ 

AX VUFQ 


7072 


2 
2 


921 


AKGTLRALE'rAKKVGKVGANGQKAAGPSAUfc>VTKMKIGSPPKTP 
VSNVAATSAGPSNVGTELHSVPQKSSPFXTRVPAYPPHSEWIOY 
FQDPRTQIPFEVPQYPQTGYYPPPPTVPAGVAPCVPRFVRSNW 
PESSLPPASMPYADHYSTPSPRDRMWSSPyQPPPPQPYGPVPPV 

SLRERYNSLDGYYSVACQPPSEPRTTVPLPREPCGHLKTSCEEO 
IRRKPDQWAQYHTQKAPLVSSTLPVATOSPTPP.<!'PT.^TOnp^_c. ° 


7073 


50 


921 


AKLrTLi^Ka'AKKyGKVGANGQKAAGPSADSVl^klGSPPKTp-- 

VSNVAATSAGPSNVCTELNSVPOKSSPFLrRVPAYPPHSENIOY 

FQDPRTQIPFEVPQYPQTGYYPPPPTVPAGVAPCVPRPVRSNW 

PESSLPPASMPYADHYSTFSPRDRMNSSPYQPPPPQPYGPVPPV 
PSGN YAP VYDSRR I WRPPM YOP nriT T D ewe T n-nwriT »»*T, « « , 

IRRKPDQWAQYHTQKAPLVSSTLPVATOSPTPPSTT.WPnKrfc. ^ 


7074 


263 


S04 


i^^^ruvSDFPAPAAAPAHTLTSKSGSLSPQFRKPLGRAPAMP 
LVRYRKWI LGYRCVGKTq7.aHnT7in?OTis ot>r^vT^«™^™^ 

""'^ i i^v* V OA i axiAHUFVEGEF S EG YDPTVENT YS KI 

VTLGKDEFHLHLVDTAGQDEYSlLPYSPIIGVHGYVLVySVTSL 
HSFQVI ESLYQKliHEGHGK 


7075 


598 


1003 


vc;FVi.cSTRQKPGH^'SLVTYFGKPTRRKBFLLGHCXAAGKMflrT3 - 

CAL^SGGGVIKAEIENEDYSVTKDGIGI^LENSFSNILLFVPB 
^"^^^ WSIiWTSGLR ITTLSSNL YKRDITSAKV 

kcc^taalefliodmkktrgrlylrpeliaicrprvdiqeennmkS 

AGVFPDRTErj3RKEKLTFTESTHVEI wi:.ii«iv«iitAi, 


70 76 


279 


1005 


MYlNFPFRKEYPPHVQKVEINPVRLSRLQG^HKlMlCKTEESP'Cin " 
AITLNESTGPLLRTSIHQNSGGQKSQNTGLTTKKFYGNNVEK^^ 


7077 




1049 


^Wi^ii^aSNAAEGNbgRHKDKUKSKRGGWSKGRKRKKPLRDSNAPK- 

^nn3f^!^'^^'^^°^^^^^^PPPE^'ri^"^NEWSKLPPEE 

KQRYLDEADRDKERYMKELEQYQKrEAYKVPSRKTQDROKGKqH 

nS^^^^^^^^^^^VFDIPIFTEEFLNHSKAREAEL 

RQI^SNMEFEERNAALOKHVESMRTAVEKLEVDVIQERSRIITV 

LQQHLETLRQVLTSSFASMPLPEXGETPTVDTIDqVM 


7078 




1119 

: 

J 

] 
I 
I 


i»bJ>l^SNSEINGrALRKTDKYGFLGGSQYSGSLKgsiPVDVAROR ' 

SLKWLDMFSNWDKWLSRRFQKVKLRCRKGIPSSLRAKAMQYLSN 
SKE£iLEQNPRKFEELERAPGDP'?CMT.m7TT?vnT tro*^T^»^T,»»«..,* 

«4. *r*i.*\*. &»A^iC(itrtj'tjU£'j\wjjUVIfilCDIjHRQFPFHEMPAA 

KGGHGQQDLYRILKAYTIYRPDEGYCQAQAPVAAVLLMHMPAEO 

™qicdkylpgyysagleaiqldgeiffaimS^ 

^RRQRIDPVLYMTEWFMCIFARTLPWASVLRVWDMFFCEGVKI 

[FRW^VLLRHlXGSVEKIJlSCQGMYETMEeiJW^PQQCMOEDF 

.VHBVTNLPVTEALIERENAAQLKKWRETRGELQYRPSRRLHGS 
lAIHEERRRQQPPLGPSSS oxu^^£> 


7079 


483 


767 I 

\ 


U^WKMAGEQKPSijMI.LEQFtl.LAKGTSGSALtAi:iSQVLSAPQ- 

YVFGELLELANVQELAEGANAAYLQLLNLFAYGTYPDYIANKE 
XiPELY 




2 


376 s 
M 


VViSFKRPKEPSGSDGhSDGPiUVGQEGQLSQMARPDSTPSSSQ 
QARKKRRGIIEKRRRDRINSSL5ELRRLVPTAFEKQGSSKLEK 
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SEQ 
ID 
NO* 


t*redicted 

beginning 

nucleotide 

location 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acia segment containing signal pe-ptld^ 

£A=Alanine, C=> Cysteine, D-Asoa*rhir- a^-T^ ^ 
^ ' u-i\sparcic Acad, E= 

« S-^H ^=P^^^ylal^nine, G=Glycine, 
H^Histidme, I«Isoleucine, K=Lysine 
L-I,eucine, M=Methionine, N^AsparagiAe 
P=Proline, Q=Glutaraine, R=Arginine ' 
S-Serine, T=Threonine, V=Valine 
W=rryptophan, Y=Tyrosine, X=UnJcAown, *=Stop 
Codon, /=poesible nucleotide deletion, 
\«possible nucleotide insertion) 


7080 
7081 


200 


595 


AJi VLQMTVDHLKMLHATGGTGTHALLFQASFIQQI F 

VQLPLEAPCi.SbbSCKDHSGGNRDLi,kKHliLlCRVyGSPQDGIPY" 

LTHPLCHODVVSVGRLQIRALATPGHTQGHLVYLt,DGEPyKGPS 
CLFSGDl4LPljSGCGEFPRia?EEI.c;pRf:T?'T'DTroTiA'wrr>T.tri>-r . 


7082 


213 
3 


SOS 


/WTi.BEMILNKl^XCYHNKLIl^PMVRVGTLPMRLLALDYGAD3-- 
^^^j^^^^^^QCKRVVNSVLSTVDPVAPDDRW 


7083 




1137 


- '-"^■vary-nuir' vxijjL,iji<gyijGTNSFXjHGLGQEPPEGAR 6 ti ~ 

CCRSSPRDLRDGEREHEAAQRKAPGAESCPSLPLSISDIGTGCL 

SSLENLRLPTLREESSPRELEDSSGDQGRCGPTHQGSEDPSMLS 

QAQSATEVEERHVSPSCSTSRERPFQAGELILAETGEGBTKFKK 

iiirn.iu.iiHi;k,iii.H5NWGAVPFGKIVGKFPGQIIJlSSFGK0YMLHHP 

ftLBDYVVIJviKROTAlTPPKDINMIMMMDiNPeDTVlEAGSGSG 

VEEWPDNVDFIHKDISGATBDIKSLrFDAVALDMLNPHVTLPVF 
yPHI.KHGGVCPVYWNITQVIBI.LD 


7084 


115 


S41 


K^r.^vui.TKMEVAKlKSLSLLYPKSLSRHVSVRTSVV-rQQLI,SEP 
SPKAPRAHP<MVSTADRSVIlKGlMAySI,Ea5IJ.IJWm)T^MIJ^K 


7085 " 


3 
243 


522 


yKLRKRKTFEDNIRKWRTVISNWlKyAQWBESLKEIQRARSIYE 
RALDVDYRNI TLWLKYABMEMKNRQVNHARNI WDRAITTL 


70B6 




1499 


^vnfr^iS^'^^^^^^P^^^^QQ^^PPYDEMFAAHLRC^ 

YAVGNHDFIEAYKCQTVIVQSFLRAFQAHKEENWALPVMYAVAL 

DLRVFANNADQQLVKKGKSKVGDlynL.EKAAEIiLMSCFRVCASDTR 

AGIEDSKKWGMLFLVKTQLFKIYPKINKLHbCKPblRAIDSSNLK 
DDySTAQRVTYKYYVGRKAMFD5npjrnaT?i7VT ct?a tsctr/^rt, 

KNJO^ILIVLLPVKMLL^^S^^^ 

^^^^^™Q^^VDIDEVQCIIJaiLlYMGHVKG^ 
SHQHQKIiWSKQNPFPPLSTGC 


7087 


256 


525 


SGHLSMEEFKKIYGNFFPYGDASKPAEHVFRTFDANGDGTIDFR 
EF 


7088 


166 


723 


K^^t'^^^^^^'^^^^^^^^^^^^^'^^^J^NVVDKGEGASRG 

NTRKSLEDNGSTRVTPSVQPHLQPIRNMSVSRTMBDSCELDLVY 

VTERIIAVSPPSTANEENFRSNLREVAQMLKSKHGGNYLLFNLS 

ERRPDITKLHAKVLEFGMPDLHTPALEKICSICKAMDTWLNAHP 
HRCRVbHNKG -"ii-nnr 


7089 


104 


759 

i 


^IPKAFATAGWLVSLVLLVFLGFMSFMTTTFVIEAMAAAWAOLHW 
^RMENLKBEEDDDSSTASDSDVLIRDNYERAEKRPXLSVQRRGS 
?NPPEITDRVBMGQMASMFPNKVGVNLFYFCI IVYLYGDIAI YA 
VAVPFSLMQVTCSATGNDSCGVEAnTYVHn^nr^^PT.p^^^ 




33 


1775 i 
i 
S 
I 
Q 
h 
R 

1 ^ 


i^CWKORyJuKARMEESPliSKAPSRGGVNFXJJVARTYIPNTKVEC^ 

lYTLPPGTMPSASDWIGIFKVEAACVRDYHTFVWSSVPESTTDG 

jPIHTSVQFQASYLPKPGAQLYQFRYVNRQGQVCGQSPPFQFRE 

LKLQLEGQVTBLRSRVQELBRAIATARQEHTELMEQYKQISRS 
GEITEERDILSRQQGDHVARILELEDDIQTISEKVI.TKEVEI.D 
LRDTVKALTREQBKLLGQLKEVQADKEQSEAELQVAQQENHHL 
LDLKEAKSWQEEQSAQAQRLKDICVAQMKDTLGQAQQRVAELEP 
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SEQ 
ID 

NO; 


Predicted 
beginnincf 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue oE 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amxno acid segment containing signal peptide 
(A^Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G^Glycine, 
r:=aisczaine, Islsoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=^paragine, 
P=Proline, Q=Glut amine, R^Arginine, 
S=Serine, TfeThreonine, V=Valine, 
W=Tryptophan, y=Tyroaine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
Vpossible nucleotide insertion) 


7090 






LKEQLRGAUiSliAASSQQKATLI^EELASAAAARDRTlAELHRSR 

LBVAEVKGKLAELGLHLKEEKCQWSKERAGLLQSVEAEKDKIDK 

LSABrLRLEKAVQEERTQNQVFKTELAREKDSSbVQLSESKREfi 

TELRSALRVLQKEKEQLQEEKQELLEYMRKLEARLEKVADEKMN 

EDATTEDEEAAVGLSCPAALTDSEDESPEDMRLHPMAFVSVETO 
ASLLLGLE 


7091 


33 


1775 


avcWEDRYLK/UCMfiESPLSRAPSRGGVNFLin/'ARTyiPNT^ 

HXTLiPPGTMPSASDWIGIFKVEAACVRDYHTFVWSSVPESTTDG 

SPIHTSVQFQASYLPKPGAQLYQFRYVNRQGQVCGQSPPFQFRB 

PRPMDELVTLEEADGGSDILLWPKATVLQNQLDESQQERNDLM 

QLKLQLEGQVTELRSRVQELERALATARQEHTELMSQYKGISRS 

HGEITEERDILSRQQGDHVARILELEDDIQTISEKVLTKBVELD 

RLRDTVBCALTREQEKLLGQLKEVQADKEQSEAELQVAQQENHHL 

NIJ3X,KEAKSWQEEQSAQA0RLKDKVAQMKDTLGQAQQRVAE]:JEP 

ijKBQIjRGAQELiAASSQQKATLrjGEELASAAAARDRTlAELHRSR 

LEVAEViraKIAELGLHLKEEKCQWSKERAGLLQSVEflEKDKILK 

LSAEILRLEKAVQEERTQNQVFKTEIAREKDSSLVQLSESKREL 

liiliRSAIiRVLQKSKEQLQEEKQELLEYMRKLEARLEKVADEKMN 

EDATTEDEBAAVGLSCPAALTDSEDESPEDMRLHP^4APVSVETQ 


7092 


186 


1076 


BQMLTRBHR^OR3EEQELEPWPSPKKAR5GRWbkNGFKRKMEEP 
EEPADSGQSIiVPVYIYSPEYVSMCDSLAKIPKRASMVHSLXEAY 
ALHKQMRIVKPKVASMEEMATPHTDAYLQHLQKVSQEGDDDHPD 
SIEYGLGYDCPATEGIFDYAAAIGGATITAAQCLIDGMCKVAIN 
WSGGWHHAKKDBASGFCYLNDAVLGILRLRRKFERXLYVDLDLH 
HGDGVEDAPSFTSFCVMTVSLHKFSPGFFPGTGDVSDVGLGKGRY 
YSVNVPIQDGIQDEKYYOXCERYEPPAPNPGL 


7093 


522 


B09 


KQQINEDQEBSQKPRLGEGCEPISKRQMKKLIKQKQWBEQRELR 

KQKRKBKRKRKKLEROCQMEPNSDGHDRKRVRRDWHSTLRLII 
DCSFDXLM 


7094 


454 


655 


wfgvsqvblauoaswvrms fviaacqlvlgllmtsltes^ ions 
ecpqlcvceirpwftpqstyrea 


7095 


2 


508 


FVRSMHWGVut'ASSRPCWDLSWNQSISFFGWWAGSEEPPSPYG 

diiafplqdyggimaglgsdpwwkktlyltggallaaaayllhb 

LLVIRKQQEIDSKDAIILHQPARPNNGVPSLSPFCLKMETYLRM 
ADLPYQNYFGGKLSAQGKMPWIEYWHEKVSGTEFII 


7096 


1 


411 


IA5SLPKMASI.LQSDRVI,YI>V0GEKKVRAPLSQLYFCRYCSbLR " 

SLECVSHEVDSHYCPSCLENMPSAEAKLKKNRCANGPDCPGCMH 

TLSTRATSISTQLPDDPAKTTMKKAYYLACGFCRWTSRDVGMAD 
KSVGE 


7097 


224 


2067 

] 
1 


ETRSLAVQSKPSQAGRRRSSRISFAGALFLTRFLtiQELLLNWFC 
ortno jr.ftjr'UiWU^Af Ai> J. S JjFDL S ADAP VFQGL SLVSHAPGE AIiAR 

APRTSCSGSGERESPSRKIiLQGPMDISEKLFCSTCDQTPQNHQE 
QREHYKLDWHRPNLKQRLKDKPLLSALDFEKQSSTGDLSSISGS 
BDSDSASEBDLQTLDRERATPBKLSRPPGPYPHRVLFQNAQGQF 
LYAYRCV1,GPHQDPPBEAELLLQNLQSKGPRDCVVU4AAAGHFA 
GAIFQGREWTHKTFHRYTVRAKRGTAQGLRDARGGPSHSAGAN 
LRRYNEATLYKDVRDLLAGPSWAKALEEAGTII.LRAPRSGRSLF 
FGGKGAPU3RGDPRLWDIPIATRRPTFQELQRVLHKLTTLHVYB 
EDPREAVRLHSPQTHWKTVREERKKPTEEEIRKICRDEKEALGQ 
^EBSPKQGSGSEGEDGFQVELELVELTVGTLDLCESEVLPKRRR 
RKRNKKEKSRDQEAGAHRTLLQQTQEEEPSTQSSQAVAAPLGPL 
J5EAKAPGQPELWNALLAACRAGDVGVLKLQLAPSPADPRVLSL 
jSAPLGSGGFTLLHAAAAAGRGSWRLLLEAGADPTVQCQDH 




256 


1228 ] 
I 


LRTKSAATWEAWPQCGREGSRIITEPCEANAGSRQELOTERISS 
T^QGDOAPHSGLEXNNSNSELPIiRVGLKVAQGSPLMGGQVSA 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

^ '^■^ i e o^ona X ng 

to first 

amino acid 

residue of 

amino acid 

sequence 


fredicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residua of 
amino acid 
sequence 


^^l"? a^^<^ segment containing signal peptide" 
(A-Alanane, C^Cysteinc, D^AspartL Acidf 
Glutamic Acid, F=Phenyl alanine, G=GlyciAe/ 
H=Hxstidine, I=IsoIeucine, K=Lysine 
L^Leucine, M=MeChionine, W^paragiie, 
P=Proline, Q=Glutamine, R=Arginine, 
S^Serme, T=Threonine, V= Valine 
W=Tryptophan, Y=Tyrosine, X^Unknown, *=stop 
Codon, /^possible nucleotide deletion, 
\-po3sible nucleotide insertion) 


7098 






L^KKSPISHEESRLLQLI^KALPCITRPWIKWSVTQALD^^ 
wnSf!^f'^''^^'^^^^^^^^^«^^^VAClKNlFGDMLCP^^ 


7099 


82 
992 


956 


SSFLKRCRKVLQCWGIPSEOSLFSTIjEEPRDKEIDNYCVMRTriT 

LVLLLQDWQPGGVGICTSFLGISWALLDYHRALRTCLPSKPTT/- 

™'ir'^^'''''^'^^"^^^^^^SALFPSYVALHFLGLWLVL 
LLW\^LQGTDFMPDPSSEWLyRVTVATTr vPQUT7MT7^«/-.«nni^™ 
n * ^v*.— - VAiijjYFSWFNvAEGRTRGH 

AI IHFAFLLSDS ILLVATWVTHSSWLPSGI PLQLWLPVGCGCRP 
_ LGLALRLVYYHWLHPSCCWKPDPDQVD ^^^^^^CFF 


'7100 


20S 


310 


i^D^^o^y^^^^^^Q^^^^^^'^PSGATATWPAASRSRSLA 
ARSLPRSPARPGPNDAIjLRFWmrpr'rkr'irawrtnT:.nt-.««™> 

GAVLEVHVPQIGAGVSLPGILAAKCGAEVII.SDSSELPHCLEVC 
RQSCQMNNIJ'HIOWGLTWGHISWDLIALPPQDIirASDVPPEP 

vr.^?nJt^^''^'^^^^Q^W^^^Q^^WSLEAI,LYKWDM 
KCVHI PLESFDADKEDIAESTLPGRHTVEMLVT q paimer 


7101 


2 


571 


/^jUtiKWEJ^PuaiilVSLPLWVPTASHSKTTALQIGS^FPHLSVL 
DWAFDLTKTNMQTMyEQSEWGWKDREKREEMTnnPiiMVT rnw^v, 

ssvpvafshfrfdvergdevlyJt ^^^"^"^^ 


7102 


2 


503 


WRGGPKUiUaUaAGGAVGWVtLVRGVHiJVk^^ 

VRILLVGEPRVGKTSLlMSLVSEEFPEEVPPRAEEiriPADVTP 

ERVPTHIVDYSEAEQSDEQLHQEISQANVICIVYAVNNKHStDK 


7103 




503 


o^^^^KKHtu<iAGGAVGW \^i.bVKGVHSVRAGGGRPPRAADMKKD 
VRILLVGEPRVGKTSI/IWSUVSEBFPBEVPPRABEITJPADVrP 
ERVPTHIVDYSEAEQSDEQLHQEISQANVICIVYAVNNKHSIDK 
VTSRWIPLINERTDKDSRLPLILGGNKSDLVEYSR ^''^'^^ 


7104 


119 


438 


LSDKGSELKKSFDAWFDVLKVTPEEYAGQITLMDVPVPKAIOP 
DELSSCGWNKKEKYSSAP i^i^i^vfvfkaIQP 


7105 


•Lo /O 


795 


M^!!!^:^!f^^^^^^""°^^^^^^^^^^^^^^^^^vi)ibiNWAGv 

MRCPHWTTEDGFEMQPGVMHLGEAWAGAAPWVQAiriPRRPPKVL 
m^n!!!«y^^^^^^^^^^^^^^^^^°K^^SAPSRIINLSSLA 
HVAGHIDFDDLNWQTRKYNTKAAYCQSXKLAIVLPTKELSRRLQ 
GSGVTVNALHPGVARTELGRHrGIHGSTPLQHHN\WAriLrAAJS 
KSPRSWPAPAQHNTLAVAEELAXVISGKYFDQLKQKAPAPEAED 
EEVARRLWABSARLVGLEAPSVRBQPLPR 


7106 


765 
14 


143 

i 


^WW^-RRPSPKSTijCLSMTCDLP/RGLQDPQCLALFKVAVDKHQA"- 
LLKAAMSGQGVDRHLFALYIVSRFLHLQSPFLTQVHSEQWQLST 
:»w X e- V mijif u VHN YPDYVS SGGGFG PADDHG YGVS YI FMGDG 

^XTFHISSKKSSTKTDSHRLGQHIEDALLDVASLFQAGQHPKRR 
i'RG3GKENSRHRCGFLSRQTGASKRSMTSTnT? 


7107 j 


1145 


1064 C 
Ti 
C 
1 
F 

R 

591 * 


.bQAGHPHPRaAaRIPEADTH^YSKLQRAFDSlV^IKDHKRMFGT 

TRVGFFGSKFGDIJ3EQEFVYKEPAITKLPEISHRLEAFYGQCF 

JAEFVEVIKDSTPVDKTKLDPNKAYIQITFVEPYFDEYEMKDRV 

'YFEKNFNLRRFMYTTPFTLEGRPRGELHEQYRRNTVLTTMHAP 

YIKTRISVIQKEEFVLTPIEVAIEDMKKKTLQLAVAIWQEPPD 

iCMLQMVLQGSVGATVNQGPLBVAQVFLAEIPADPKbYRHHNKL 

LCPKEPIMRCGEAVEKNKRLITADQRBYQQELKKNYNKLKENL 

PMIERKIPEr.YKPIFRVESOKRDSFHRSSFRKrRTor.snnj=| 

I*WJW3TGKKK — 
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SEQ 
ID 

NOt 



7108 



"predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 



7109 



964 



Predicted enST 
nucleotide 
location 
corresponding 
to firat 
amino acid 
residue of 
amino acid 
sequence 



942 



102 



7110 



96 



697 



7111 



7112 



7113 



7114 



7115" 



495 



824 



1492 



7116 



866" 



95 



«mino acid segment contaiimig signal peptide" 
(A=Alanine, C=Cyeteine, D=Aspartic Acid, E« 
Glutatnic Acid, F-Phenylalanine, G-Glycine 
H=Hxstidine, I=Isoleucine, K=Lysine 
L^Leucine, M=. Methionine, N=:Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S^Serine, T^Threonine, V=Valine 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=StoD 
Codon, /-possible nucleotide deletion,. 
N=po33ible nucleotide insert ion) 

VKVAItLLTKrLEgPRTBSEWEN SFTLKMFT.FQFVJ^l^SSTFYIAP - 
FLGRFTGHPaAYT.PT.TwomoT.T?Tr^«r,^/s«* "^^^ 



FLGRFTGHPGAYLRLXNRWRI^ECHPSGCLIDLCMQMGIIMV^ 
QTVmNFMELGyPLIQmWTRRKVRQEHGPERKISFPQWEKDYNL 
QPMNAYGLPDEYLEMlLQPGFTTlFVAAPPIAPLIJai^IEI 
RLDAYKFVTQWRRPLASRAKDIGIWYGILEGIGILSVITNAFVT 
AITSDFIPRLVYAYKYGPCAGQGEAGQKCMVGYVNASLSVFRIS 
DPENRSEPBSDGSEFSGTPLKYCRYRDYRDPPHSI^VPYOYTLQP 



WDQRKRNSLVPGPAHGPAQEEP WKKKJaSLGAAQEAI^IQLQPW 

TQPFPKSEQVYLHFLSWTEDGPEPKDKGSLPQPPITEVESQVP 

fnz^I^^^^^^^^^^'^^^^Q^PK^^RWSPAQEESPRQM 

WIHKEIPTGKKDHECSECGKTPI^SHLWHQRVHSGBKPYKC 

SDCGKTPKQSSNLGQHQRIHTCEKPFECNECX3KAFRWGAHLV0H 

QRIHSGEKPYECNECGKAFSQSSYLSQHRRIHSGEKPFICKECG 
KAYQWCSELIRHRRVHARKEPSH '^^^^i^^CG 

ta.Ui?fFSGFLVEVTKEERHIVKP LyDRYRLVKOMLTHASITPVLG -- 
SPSTKRRGQMLQPIIEGETAHFFEEIKEEEEDGVNLSSELGDML 
r^v^SJf^^^^^^^^'^^^^Q^^^L^^^^SSSRAASMPELLSQ 
LWipuiAEKKKLRKTUlEFEEAFyQQNGRNAQKEDRVPVLEEYRE 
YKKIKAKU^LLEVLISKQDSSKSI vxiiiisxKS 



(jaUi.YRGPTPGGQCIWKPN SMPPDHERHFQF'I'^FALELNELTAg " 
LKRSLPSTDTRLRPDQRYLESGNIQAAEAQKRRIEOUJRDRRKV 
MEENNIVHQARPFRRQTDSSGKEWWVTmJTYMRLRAEPGYGNMD 

PKQ:FPVADRGRLIGGLPDW Ti^aKGKTL«I,TCTVFGNPDPBV^l ^ 
PKNDQDIQLSEHFSVKVEQAKyVS^^TIKGVTSEDSGKySINIKN j 
KYGGEKIDVTVSVYKHGEKIPDMAPPQQ AKPKLI PASASAAGQ 

KCLKOAWHKAPSSIAFTRWC SREERAEGGGNI^RSITRDPKPPcH 

UIPSQRPMDDKKKKRSPKPCLAQPAQAPGTUIRVPVPTSHSGSL 

ALGLPHLPSPKQRAKFKRVGKEKGRPVLAGGGSGSAGTPLOHSP 

LTEyTDVyE^4EGGLLNLIJ^DFHSGRLQAPGKECSFEQr^^VREM 

QBKLARLHFSLDVCGBEEDDEBBEZXSVrEGI.PEEOlCKTMADRNL 

SSGPLNLPRR^'^^^^ 

vy^l:;VD£QIDHYKESQDKFLWQ AAFIGKETIiKDKSGQECKICRKr 

lYLNTDFVSVKQRLPKYYSWERCSKHHLNFLGQNRSYVRKKDDG 

^YWKVCnHYNLHKAQPAERFFDPNQRGKALHQKQALRKSQRS 

QTGBiaYKCTECGKVFlQKANLVVHQRTHraEKPyBCCECAKAP 

SQKSTLIAHQRTHTGEKPYECSECX3KTPIQKSTLIKHQR1«TGE 

KPFVCDKCPKAPKSSYHLIRHBKTHIRQAPYKGIKCTTSSLIYQ 

RIHTSEKPQCSEHGKASDEKPSPTKHWRTHTKENIYECSKCQKS 

FRGKSHLSVHQRIHTGHKPYECSrCGKTFSGKSHLSVHHRTHTG 

EKPYBCRRCGKAFGEKSTIiIVHQRMHTGEKPYKCWECGKAFSEK 

SPLIKHQRIHTGERPYBCTDCKKAFSRKSTLIKHQRIHTCEKPY 

KCSECGKAFSVKSTLIVHHRTHTGEKPYECRDCGKAPSGKSTLI 



MAAHGYNWGLWCMYI IPPQDWLDRGDESAPIR TPAMlGCSPWD 

REYFGDIGLLDPGMEVYGGENVKLGMRVWQCGGSNEVLPCSRVA 

HrERTRKPYNNDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPM 

SNPGVDFGDVSERIiALRQRLKCRSFKWYLENVYPEMRVYKKTLT 

YGEVRNSKASAYCLDQGAEDGDRAILYPCHGMSSQLVRYSADGL 

LQLGPLGSrAPLPDSKCLVDDGTGRMPTLKKCEDVARPTQRLWD 

PTQSGPIVSRATGRCLEVEMSKDANFGLRLWQRCSGQKWMIRN 
WXKflARH 



KVRMRRNAEViEEKIiSMKSWAKFRPGEpWKGYPNIDPETDPYVT 
PGSVTNNLSINTVREVDHLRDRN5GSSSSLNTTLPSTSAWSS IR 



598 



wo 01/53312 



PCT/USOO/34263 



SEQ 
ID 
NO: 


fredicted 
beginning- 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


r Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


^ino acid segment containing signal peptld^ 
<A=^lanane, C=Cysteine. D=Aspartic Acid e« 
Glutarnic Acid, F=Phenylalanine, G^OlyciL 
H=Kxstxdine, l-lsoleucine, K=Lysine 
L=Leucine, M=Methionine, N-Asparagine. 
P^Proline, Q=Glutamine, R=Axginine, 
S=Serine, T=:Threonine, V^Valine 
W=Tryptophan, Y=Tyrosine, X=UnkAown, *=stop 
Codon, /^possible nucleotide deletion 
\=possable nucleotide insertion) 


7117 


695 




LPPKNITAPSRPPPGLTGQKPPLSrWDNSPLRIGGGWGNSD^Y 

tpgsswgbsssgritnwlvlknltpqidgstlrtlcSgp?^^ 


7118 


49 


12^1 


^^^^'''"^^"^^^^^^^^'^"^'yTGAWGKALPAPHMPCAPGALPQGA 
PVSQAARAiPLLQPSQAAQAEGLSOPARAOGALCSLPWPlS 


7119 


49 


1863 


" oo^^''''''''^^^^^^^^^^^"^^^^^^KEVBElSLLQPQV^ ' 
NMIIQSISLLDQLDKDINTFSMRVREWYGYHPPELVKIINDNAT 

MSSDLEETAGSTSIPKRKKSTPKEETVNDPEEAGHRSGSKKKRK 
_FSKBEPVSSGPEEftAGKSSSKKKKKrHKA.^r.Fn ^^^^^^K^K 


7120 


1991 


1863 


BMIIQSISM.DQLDKDINTPSMRVRBWYGYHPPELVKIHroHAT 

MDlSAlDLINIESPSSRWSLSEYRQSLHTYLRSKMSQVAPSli 
ALIGBAVGARLIAHAGSLTNLAKYPASTVQILGAEKRLPRALKT 

sevptsvfgeklreqveerlspyetgeiprknldvmkeamvqS 

BAAAEITRKLEKQEKKELiCKEKKRLAALAIASSEKSSSTPEECE 
FSKBEPVSSGPEERAGKSSSKKKKKFHKASOED 


7121 




64 

1 
i 
h 

2 
A 


^^'"'"'"'"^'^^'^Q'^^^^^^PRglEPOTANI.SWFKDS- 

RAEAMMRSSIERGKWVFFQNmiAPSWMPALERLIEHINPDKVH 
RDFRLWLrSLPSKKPPVSILQNGSKMriBPPRGVRANI.LKSYSS 
LGEDFLNSCHKVMEPKSLLLSLCLFHGNALBRRKFGPLGFNIPY 
^^^^''"^^^^^^'^'^^"'^^'^IPYKVLKVTAGBINYGGRVTD 
aWDRRCIMNILEDFYNPDVLSPEHSYSASGlYHQIPPTYDLHGY 
* ufTULnui ixrnt i r (jijttiJW ANITFAQlTETFAliLGTI lOLOPK 

L^^.f^^^^^^'^'^^^^^^^^SP^^^^KYPVLyEES 

PAVFWISGFFFPOAPLTGTLQNFARKFVrSIDTISFDFKVMFE 

.VIWLLPTPNRKAQDQDFyLCPIYKTLTRAGTLSTTGHSTNYVI 
VEIPTHQPQRHWIKRGVALICALDY v:'«c»iiN i vi 




2 


545 K 
L 
S 
G 


i/bKPWVI,SLGaMVGLMTYGRRQFQSLDTTMKRLlPPFREASAK"~ 
TTLVDADAEAPTAYLEAMRLPKNTPEEKDRRTAALQEGLRRAV 
VPLTIAETVASLWPALQBLAROGNLACRSDU3VAAKALEMGVF 
amm^Ijy^ITDEAFKDQIHHRVSSLLQEAKT 
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SEQ 


iHrediehorf ' 1 Ik !j _ 4 


ID 
NO: 


beginning 
nucleoUide 
location 
corresponding 
to first . 
amino acid 
residue of 
amino acid 
sequence 


r-j:euiccea enc 
nucleotide 
location 
correspondinc 
to first 
amino acid 
residue of 
amino acid 
sequence 


H.Histidine, I=Isoleuclne, K-r,ysins, 

c;I^ 7^"'.^:'^°^^"^' X=Unknown, -stop 
codon, /=possable nucleotide deletion 
N^ossible nucleotide inserting, 


7122 
7123 


1 


546^ 




7124 


2 


1092 


j^^^OT^XQ«^p;^..«,^^J^^ 


7125 


lb& 


382 


tx,c;t,''''^^-^^^^^"PQSPCrtliP(JPALSPGTLGgSgw^ 


7126 


1 


1127 


TO|ljaiU..ll;|dlU.KltKlllKI,S'SLa^,- 


7127 


1311 


733 




7128 




277 








5228 j 

1 

N 
L 
G 

S 

c 

R< 
TI 

i 

KV 
I OP 
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SEQ 
ID 
NO: 


i^redicteci 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


^u.xno acia segment containing sxgnai peptlSi-H 
CA=Alanxne, ^Cysteine, D=Aspartic Ac?df E- 
f w^!^-!.'''''*'' ^^Phenylalanine. G^GlyciAe 
H=Histxdxne, I^Isoleucine, K^Lysine, 
L^Leucine. M=Methionine, N^Asparagine, 
P=Prolxne, Q^Glutamine, R=Arginine. 
S=Serxne, T=Threonine, V= Valine 

r^ri^J^^/^^^"'.!;^^'''*^^^^' X=Unk;own, *=stop 
Codon /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


7130 


2 


" 10S4 


SPLFGTMPPSETSEPSILETCALlMKIICLEiyYVVKGS^P 

PASVNCLRI^MKCTLIilLLRQWKREIKSSVDElSTEiLEC 
CETLQEBVIALFDQTRHSIAI/SSATEDKDSMETDDCSRSRHRD^ 
SIJIMKQNLHFTEATLHI^TIJ«TQQGATAVAGAQITQS^PL 

lsvyqls™gtaqtpsasrkslda;5wpgvyr£2msSSkt 

LRWLPBAia3FVGVHQERTWCIiaVRTVQSLACLSEA§mVG 
^^^:2'''™°^'''^^^\^°^^\«^PSAAS^PSsSS 

EyNFLPALSFTTPTFMEVAPSFGTLIJ^TVNVAi^NMLGELDK^ 
EPLTQAVGLSTC3AEGTRTLKSLLMFTMENC?^SS^ 

pavkprdkormkoelsselstllssi^ryprroapssSotp 

^OGKSrSLSKASPESQEPLIOLVOAFVpSSnp ^^""^"^ 
I'KKi-JtWRRRLH^A^PASSAWSPGIiiASGTM^tjkl.uuAJiMtKTritto 
WDQSTFIGRAiraFFTVTDPRNIIJ,INEQMSARKIVHDYRMIV 

ppgltenelwhakyiydsafhpdtgekSSSto^^t 
tocmmtfvrttpavlfwqwinqsfi«.vvnySSS 

GTAWSATTGAVATALGUJALTKHVSPLIGRFVPFAAVAA^ 

►WAPGMAIPPFIfmTLEmFLKRFPWMSAPIQVGLVGPCLVFA 
TPLCCALFPQKSSMSVTSLEAELQAKIQESHPELRRVYF^r 


7131 


80S 


780 


lZ^og;^c^^°^'^°^^^^"^^^"^^'^^PH^V^^SG- 
ISAKSQPYHRSQSSSSVLINKSMDSINYPSDVOKQQLLSljffiSS 

AQIATQNYFSNFKETDGDEDDYVEIKSEEDESELELSHNRRR^ 

dskftoadfsdhvcsgnti.hslnsprtpkkpwskSS^ 

VNDSDKLtJDYLWHGPSPNQQNIVQSl^KFOCI^RSSP* 


7132 


1420 




^ r^nv;^^r''^^^'^^'^"^^*'^">^"°""^"AVO^NHt.EW- 
KLL0DYQDSYTI.SETOAEAAAEAT...«!1fBWr.Rcv»r 


7133 


2 


1087 


ir'rI^i;io'^"'^^'^^^"^^*'^^^''^"l^^l«'«NAHALSTVTA " 






3648 ( 

^ 

£ 
F 
A 
I 
E 
R 
S 
R 


JPQWVLVPEMDVai.rRSASFSRKWSSSKTSSGSQALVlRSRL 
iLPEMVGHPAFAVIPQLEYVFSSPAGVDGNAASWSLSOTA™^^ 
ITOmiA^PLLEMSGRVTLPLQGGIQPNPSHCLVYKVpiS 
EEVKQVESGTLRFQFSLGSEEHLDAPTEPVSGPKVERRPSRKP 

tspssppapvprvlaapqnspvgpglsisqlSr^S 

^PTSQLPHGSQASPAQAQEFPI^GISIttEADLsSLETS 
.,^»°^^^^*''^^"™PQJ®=SDCI,QSNEMVLQFLAFS 

nAVQTLQIDVWDGDSLLLTG.qnav9MTmTT.pr,r.pry^Vnr^^TTIT 
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SEQ 
ID 
NO: 



predicted 

beginning 
nucleotide 
location 
corre spondi ng 
to first 
amino acid 
residue of 
amino acid 
sequence 



Freiiictecl end 
nucleotide 
location 
corre sp ond i ng 
to first 
amino acid 
residue of 
amino acid 
sequence 



7134 



2115 



1111 



7135 



7136 



7137 



7138 



~7139 



418 



466 



7140 



"1401" 



1357 



-mxno acid segment contaiiixiiy i,agnal peptide- 
(A^Alanxne, C^Cysteine, D^Aspartic Acid E- 
Glutamic Acid. P= Phenyl alanine, G^GlyciAe 
H=Histidane, I=Isoleucine, K=Lysine/ 
L=l.eucine, M^Methlonine, N=Asparagine, 
P^Proline, Q=Glutamine, R^Arglnine, 
S=Serane, r=Threonine, V-Valine 

Codon, /.possible nucleotide deletion 
\^po3sible nucleotide insert- i r>r. \ 



\^po3siJ^ie nucleotide insertion) 




JVFR 
'NV 
lTPT 



PQELKTDPKGWVLPPRGVQDUWCSVRPli^sSvMj^^^m 

WGGLSTRSEMCLSYLLVYPRINLTRCASIPDIMEQ^^^ 
YRPVTTWPFIlKSPKQYKNbSEWDAMNKFKWTKKEG?^ 



)RAEMT 
IGVKEI 




sslhrdfsini,lvciiH,lscti>5tksl' 



sgkitsgebviaslsghwdrdvpikekgsgssalfwtpsgS^^ 

_EEaQRQHARERQESLMPWKPQI.PHLDPITOB^o^^^*^'^ 



LroiAQFEQMILRTI^EAVARQTTPLGSPGPRHERSQPMRL 
RKMDQPSGHSQATESSGSTPESCPBLSDEEQ^^reG^rc 
PrPGLLQSPRSWgLI.CVF IACr.T.p-n^TT.^ «™^=.TARAAQA 
UfVPSFRRPS Cal 'l 'iSQTVWLI.RAAri.KKteOar.^ L,...,. . 



"""v-J«-ivj»vrui4>.Vf I>flCQLFINHTT.g 

i>>vPSFRRPS Cal 'l 'jiQ-lvWLI.RAaTLt;Kli</AGtREKlHHLDDMTV 

sqqrkvrqmieolqnskaviqskdatiqelSk?^^^^^^ 
^r^liekqishgnfstxjaraxtenp^gsSpSJ;^ 

^^^^^^^^^ 

WA^u«,aTVPGG SkHSLC?XQVRGGWGVTGGBEBSLTVPVADTWOA 



^ , *'-'^^V"J-XCK J::'tUjl:'QXSDDEKKDF 

«^aMJVI,KAW (jUi,n.3SFQQQHl- G mMLESI,FDLKVYDCPg.;y 
j!»gJ^gLEKQLRPSQPWPRGKQtKTggB«K^gp^o^^ 
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1 SEQ - 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Jr'redlcteci ^nA' 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


acxa seg.ueuL containing signal pepfiHi-l 
(A^lanme, C=Cysteine, D^Aspartic Acid fif^ 
H-Hi^tidin^ ^.Phenylalanine. otGlylte 
H-Histidane, I-Isoleucine, K=Lysine 
L^Leucine, M.Methionine, N^AsparagiAe, 
P=Prolxne, Q^Glutamine, R^^Arginine 
S=Serine, T=Threonine, V=Valine 

Codon, /.possible nucleotide deletion, ^ 
\-possable nucleotide insertion) 


7141 


124 






7142 


658 


1073 




7143 




839 




7144 


1 


773 




TP APinricii 




988 

( 
1 


RCPAPRPAGVSYVIRDEVBKyNHNGVNAI^LDPALNRL^Ar^ 
DTTVK7MNAHKC3F<™STLRTHKDWKaLAyAKDSASAGfnp 



TRADOCS:I416260.1(%CSKOI!.DOC) 
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WHAT IS CLAIMED IS: 
1. An isolated polynucleotide comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO:l-1786 and 3573-5358, a mature protein coding portion 
of SEQ ID NO:l-1786 and 3573-5358. an active domain of SEQ ID NO:M786 and 
3573-5358, and complementary sequences thereof 

2. An isolated polynucleotide encoding a polypeptide with biological activity 
wherein said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent' 

hybridization conditions. 

3. An isolated polynucleotide encoding a polypeptide with biological activity 
wherem said polynucleotide has greater than about 90% sequence identity with the' 
polynucleotide of claim I . 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 1. 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 
operatively associated with a regulatory sequence that modulates expression of the 
polynucleotide in the host cell. 

10. Anisolatedpolypeptide.whereinthepolypeptideisselectedfromthegn)up 

consisting of: 
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(a) apolypeptideencodedbyanyoneofthepolynucleotidesof Claim 1; and 

(b) a polypeptide encoded by a polynucleotide hybridizing under stringent 
conditions with any one of SEQ ID NO: 1-1786 and 3 573-5358. 

11. A composition comprising the polypeptide of claim 10 and a carrier. 

12. An antibody directed against the polypeptide of claim 10. 

■13. A method fordetecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a 
complex with the polynucleotide of claim 1 for a period sufficient to fomi the complex- 
and ' 

b) detecting the complex, so that ifa complex is detected, the 
polynucleotide of claim 1 is detected. 



14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions 
with nucleic acid primers that amieal to the polynucleotide of claim 1 under such 

conditions; 

b) amplifying a product comprising at least a portion of the 
polynucleotide of claim 1; and 

c) detecting said product and thereby the polynucleotide of claim 1 in 

the sample. 



15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the 
method further comprises reverse transcribmg an amiealed RNA molecule into a cDNA 

polynucleotide. 

16. A method for detecting the polypeptide of claim 10 in a sample, comprising: 
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a) contacting the sample with a compound that binds to and forms a 
complex with the polypeptide under conditions and for a period sufficient to form the 

complex; and 

b) detecting fonnation of the complex, so that if a complex fonnation 
IS detected, the polypeptide of claim 1 0 is detected. 

17. A method for identifying a compound that binds to the polypeptide of claim 10. 
comprising: 

a) contacting the compound with the polypeptide of claim 10 under 
conditions sufficient to fonn a polypeptide/corapound complex; and 

b) detecting the complex, so that if the polypeptide/compound 
complex is detected, a compound that binds to the polypeptide of claim 10 is identified. 

IS. A method for identifying a compound that binds to the polypeptide of claim 10, 
comprising: 

a) contacting the compound with the polypeptide of claim 10, in a 
cell, under conditions sufficient to fonn a polypeptide/compound complex, wheiem the 
complex drives expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence 
expression, so that if the polypeptide/compound complex is detected, a compound that 
binds to the polypeptide of claim 10 is identified. 

1 9. A method of producing the polypeptide of claim 10. comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected 
from the group consisting of a polynucleotide sequence of SEQ ID NO:M786 and 3573- 
5358, a mature protein coding portion of SEQ ID NO:1.1786 and 3573-5358. an active 
domain of SEQ ID NO:l-1786 and 3573-5358. complementary sequences thereof and a 
polynucleotide sequence hybridizing under stringent conditions to SEQ ID NO:I-1786 
and 3573-5358. under conditions sufficient to express the polypeptide in said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 
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20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of any one of the polypeptides SEQ ID NO:1787 -3572 and 5359-7144 
the mature protein portion thereof, or the active domain thereof. 

21. The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 

array. 

22. A collection of polynucleotides, wherein the collection comprising the sequence 
mformation of at least one of SEQ ID NO:l-1786 and 3573-5358. 

23. The collection of claim 22, wherein the collection is pn>vided on a nucleic acid 
array. 

24. n.e collection of claim 23. wherein the anay detects full-matches to any one of 
the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any one of 
the polynucleotides in the collection. 



26. The collection of claim 22, wherein the collection is provided in a computer- 

readable format. 



27. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 1 0 or 20 
and apharmaceutically acceptable carrier. 

28. A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that specifically 



binds to a polypeptide of claim 10 or 20 and a pharmaceutically acceptable 



earner. 
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considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious :> a person skilled in the art 

document member of the same patent family 




Form PCT/ISA/210 (second sheet) (July 1998) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/USOO/34263 



Box I Observations .vhere certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This international tepon has not been established in respect of cenain ctoims und er Article 17(2)(a) for the following . 
[ I Claim Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



Claim Nos.: 

becaiBe they relate to pans of die international application that do not comply with the prescribed reouirements to such 
an extent that no meanmgful international search can be carried out. specifically; ^""^'"^ requirements to such 



□ 



Claim Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third 



sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 

I^M.T'T' S^^hing Authority found multiple inventions in this international application as follows- 
This includes 4 invention Groups and 3572 sequence species -ppircanon. as toiiows. 



^arcLTSm?"'""'' '''' ""^ ""^ ""^ "''^"^^ "=P°" ^^'^ ^" 

covers only those claims for which fees were paid, specifically claims Nos. : 



^ IV^^lT additional search fees were timely paid by the applicant. Consequently, this interna.ional search report is 
restricted to the mvention first mentioned in the claims: it is covered by claims Nos : 

Remark on Protest □ The additional search fees were accompanied by the applicant's protest. 

I — I No protest accompanied the payment of additional search fees. 
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I »vemiv?^SSrPCTiuM - «> 'in^^- as to form a single 

Group I. Claims Ml. 13-16. and 19-26 ZZT^JmSl^J^I^Z^ ^ «PP™P"«« ''"Ikio™,! serch fees mus, be paid 
acids, polypeptides, method^ ofrrl^sSriS^SSZ^^'/.^'T »>'<='^c'^ 
Group H, claim 12 and 28. drawn to aSSStod ofZ^^^^ ""^""'^ "''"^ Po'XPeptWes. 

I u oup i V. Clam. 27. drawn to method of treatmeru usmg composition comprising poljwtWes. 

polypeptides expressed there^. velto^and SsSSS« sal f^Z!?-^ ^^"i^ ' encompasses nucleic acids. 

of use of .his jubject matter. G^ups I^V a^are dTrlt^o Z ZsS2|^^^^^ "'"''"''"8 as we.! as the first method 

directed to an amibody and method of treatment usins same whi^ ^ ^ summarized as follows: Group II is 

I what is bound Is different from what is Si tte^iiuS^of G^^^ "'""'"^ 

the polypeptides of Group I as the antibody ofGrZT^r^mr^TaV.f ^Lt"""^'' P°'yP*P"*» G""-? I do not bind 

special technical features from detection. Group HI fs diS » tJeSifi™ „n ."^ «««'«^«« 

identified in any of the other Groups and thus c^eartt S^i« fwn s^2f,^h„^ ? r'"*"* ''^ " P^'VP^""*- ^hich is not 

which is a clearly different methods than the mSrthrotter GroSr- ^"'"P ^ '° 

I different special technical features and thussup^Zis lirof unjy "^-^^ °^ ""^ » 

I i^in^/nSU^r.?^^ "« generic invention. Tltese species a« deemed to lack unity 

species .0 be searched. *e apptop"««^i«o^trrchS musX;^^^^^ ^ ^"'l."- ""^^ ' 

polynucleotides ami the polypeptides encoded S as r«r3f b^rt I^. "«='"<to " series of 

of these polynucleotide sej^n^ encodes ZS^^i^S^Zi^r^t^ ^ Nos: 1-1786. and 3573-5338. Each 

I *fines its own special technical feature. In su.nm«t^StTs a ^^^^ Therefore, each of these genes 

polypeptide sequence encoded thereby. represented by one polynucleotide sequence and oiTe 



